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Preface 


A free and open-source calculus 

As we have noted in the single-variable edition of Active Calculus, several fundamental ideas in 
calculus are more than 2000 years old, and calculus as a formal subdiscipline of mathematics was 
first introduced and developed in the late 1600s. Mathematicians agree that the subject has been 
understood - rigorously and by experts - since the mid 1800s. The discipline is one of our great 
human intellectual achievements. 

While each author of a calculus textbook certainly offers their own creative perspective on 
the subject, it is hardly the case that many of the ideas the author presents are new. Indeed, the 
mathematics community broadly agrees on what the main ideas of calculus are, as well as their 
justification and their importance; the core parts of nearly all calculus textbooks are very similar. 
As such, it is our opinion that in the 21st century - an age where the internet permits seamless and 
immediate transmission of information - no one should be required to purchase a calculus text 
to read, to use for a class, or to find a coherent collection of problems to solve. Calculus belongs 
to humankind, not any individual author or publishing company. Thus, a main purpose of this 
work is to present a new multivariable calculus text that is free. In addition, instructors who are 
looking for a multivariable calculus text should have the opportunity to download the source files 
and make modifications that they see fit; thus this text is open-source. 

Any professor or student may use an electronic version of the text for no charge. A .pdf copy 
of the text may be obtained by download from the Active Calculus home page, or directly from 

http://gvsu.edu/ s/Wb. 

Because the text is open-source, any instructor may acquire the full set of source files, by request 
via email to Steve Schlicker. 

This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 
3.0 Unported License. The graphic 
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that appears throughout the text shows that the work is licensed with the Creative Commons, that 
the work may be used for free by any party so long as attribution is given to the author(s), that the 
work and its derivatives are used in the spirit of "share and share alike," and that no party may 
sell this work or any of its derivatives for profit. Full details may be found by visiting 

http://creativecommons.org/licenses/by-nc-sa/3.0/ 

or sending a letter to Creative Commons, 444 Castro Street, Suite 900, Mountain View, California, 
94041, USA. 


Active Calculus - Multivariable: our goals 


In Active Calculus - Multivariable, we endeavor to actively engage students in learning the subject 
through an activity-driven approach in which the vast majority of the examples are completed by 
students. Where many texts present a general theory of calculus followed by substantial collec¬ 
tions of worked examples, we instead pose problems or situations, consider possibilities, and then 
ask students to investigate and explore. Following key activities or examples, the presentation 
normally includes some overall perspective and a brief synopsis of general trends or properties, 
followed by formal statements of rules or theorems. While we often offer plausibility arguments 
for such results, rarely do we include formal proofs. It is not the intent of this text for the instructor 
or author to demonstrate to students that the ideas of calculus are coherent and true, but rather for 
students to encounter these ideas in a supportive, leading manner that enables them to begin to 
understand for themselves why calculus is both coherent and true. 

This approach is consistent with the following goals: 


• To have students engage in an active, inquiry-driven approach, where learners strive to con¬ 
struct solutions and approaches to ideas on their own, with appropriate support through 
questions posed, hints, and guidance from the instructor and text. 

• To build in students intuition for why the main ideas in multivariable calculus are natural 
and true. We strive to accomplish this by using specific cases to highlight the ideas for the 
general situation using contexts that are common and familiar. 

• To challenge students to acquire deep, personal understanding of multivariable calculus 
through reading the text and completing preview activities on their own, through working 
on activities in small groups in class, and through doing substantial exercises outside of class 
time. 

• To strengthen students' written and oral communicating skills by having them write about 
and explain aloud the key ideas of multivariable calculus. 






Features of the Text 
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Similar to the presentation of the single-variable Active Calculus, instructors and students alike 
will find several consistent features in the presentation, including: 

• Motivating Questions. At the start of each section, we list motivating questions that provide 
motivation for why the following material is of interest to us. One goal of each section is to 
answer each of the motivating questions. 

• Preview Activities. Each section of the text begins with a short introduction, followed by 
a preview activity. This brief reading and the preview activity are designed to foreshadow 
the upcoming ideas in the remainder of the section; both the reading and preview activity 
are intended to be accessible to students in advance of class, and indeed to be completed by 
students before a day on which a particular section is to be considered. 

• Activities. Every section in the text contains several activities. These are designed to en¬ 
gage students in an inquiry-based style that encourages them to construct solutions to key 
examples on their own, working either individually or in small groups. 

• Exercises. There are dozens of calculus texts with (collectively) tens of thousands of exer¬ 
cises. Rather than repeat standard and routine exercises in this text, we recommend the use 
of WeBWorK with its access to the National Problem Library and its many multivariable 
calculus problems. In this text, there are a small number of challenging exercises in each sec¬ 
tion. Almost every such exercise has multiple parts, requires the student to connect several 
key ideas, and expects that the student will do at least a modest amount of writing to answer 
the questions and explain their findings. For instructors interested in a more conventional 
source of exercises, consider the freely available text by Gilbert Strang of MIT, available in 
.pdf format from the MIT open courseware site via http : //gvsu . edu/s/bh. 

• Graphics. As much as possible, we strive to demonstrate key fundamental ideas visually, 
and to encourage students to do the same. Throughout the text, we use full-color graphics 
to exemplify and magnify key ideas, and to use this graphical perspective alongside both 
numerical and algebraic representations of calculus. The figures and the software to generate 
them have been created by David Austin. 

• Links to Java Applets. Many of the ideas of multivariable calculus are best understood 
dynamically, and there are a number of applets referenced in the text that can be used by 
instructors and students to assist in the investigations and demonstrations. The use of these 
freely available applets is in accord with our philosophy that no one should be required to 
purchase materials to learn calculus. We are indebted to everyone who allows their expertise 
to be openly shared. 

• Summary of Key Ideas. Each section concludes with a summary of the key ideas encoun¬ 
tered in the preceding section; this summary normally reflects responses to the motivating 
questions that began the section. 
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How to Use this Text 


This text may be used as a stand-alone textbook for a standard multivariable calculus course or as 
a supplement to a more traditional text. 


Electronically 

Because students and instructors alike have access to the book in .pdf format, there are several 
advantages to the text over a traditional print text. One is that the text may be projected on a 
screen in the classroom (or even better, on a whiteboard) and the instructor may reference ideas 
in the text directly, add comments or notation or features to graphs, and indeed write right on 
the text itself. Students can do likewise, choosing to print only whatever portions of the text are 
needed for them. In addition, the electronic version of the text includes live html links to java 
applets, so student and instructor alike may follow those links to additional resources that lie 
outside the text itself. Finally, students can have access to a copy of the text anywhere they have 
an internet-enabled device. 


Activities Workbook 

Each section of the text has a preview activity and at least three in-class activities embedded in the 
discussion. As it is the expectation that students will complete all of these activities, it is ideal for 
them to have room to work on them adjacent to the problem statements themselves. As a separate 
document, we have compiled a workbook of activities that includes only the individual activity 
prompts, along with space provided for students to write their responses. This workbook is the 
one printing expense that students will almost certainly have to undertake, and is available upon 
request to the authors. 

There are also options in the source files for compiling the activities workbook with hints for 
each activity, or even full solutions. These options can be engaged at the instructor's discretion. 


Community of Users 

Because this text is free and open-source, we hope that as people use the text, they will contribute 
corrections, suggestions, and new material. At this time, the best way to communicate typo¬ 
graphical errors or suggestions for changes is by email to Matt Boelkins at boelkinmggvsu . edu. 
At Matt's blog, http: //opencalculus .wordpress .com/, we also share new developments, 
post feedback received by email, and include other points of discussion; readers may post addi¬ 
tional comments and feedback. 
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Chapter 9 

Multivariable and Vector Functions 

9.1 Functions of Several Variables and Three Dimensional Space 

Motivating Questions 

In this section, we strive to understand the ideas generated by the following important questions: 

• What is a function of several variables? What do we mean by the domain of a function of 
several variables? 

• How do we find the distance between two points in M 3 ? What is the equation of a sphere 
in JR 3 ? 

• What is a trace of a function of two variables? What does a trace tell us about a function? 

• What is a level curve of a function of two variables? What does a level curve tell us about a 
function? 


Introduction 

Throughout our mathematical careers we have studied functions of a single variable. We define a 
function of one variable as a rule that assigns exactly one output to each input. We analyze these 
functions by looking at their graphs, calculating limits, differentiating, integrating, and more. In 
this and following sections, we will study functions whose input is defined in terms of more than 
one variable, and then analyze these functions by looking at their graphs, calculating limits, differ¬ 
entiating, integrating, and more. We will see that many of the ideas from single variable calculus 
translate well to functions of several variables, but we will have to make some adjustments as 
well. 

Preview Activity 9.1. When people buy a large ticket item like a car or a house, they often take 
out a loan to make the purchase. The loan is paid back in monthly installments until the entire 


1 






2 
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amount of the loan, plus interest, is paid. The monthly payment that the borrower has to make 
depends on the amount P of money borrowed (called the principal), the duration t of the loan in 
years, and the interest rate r. For example, if we borrow $18,000 to buy a car, the monthly payment 
M that we need to make to pay off the loan is given by the formula 


M = 


1500r 


1 - 


l 


(!+£) 


T Tt 


The variables r and t are independent of each other, so using functional notation we write 


M (r, t) 


1500r 


1 - 


( 1+ ii) 


Tit 


(a) Find the monthly payments on this loan if the interest rate is 6% and the duration of the 
loan is 5 years. 

(b) Evaluate M (0.05,4). Explain in words what this calculation represents. 

(c) Now consider only loans where the interest rate is 5%. Calculate the monthly payments as 
indicated in Table 9.1. Round payments to the nearest penny. 


Duration (in years) 

2 

3 

4 

5 

6 

Monthly payments (dollars) 







Table 9.1: Monthly payments at an interest rate of 5%. 


(d) Now consider only loans where the duration is 3 years. Calculate the monthly payments 
as indicated in Table 9.2. Round payments to the nearest penny. 


Interest rate 

0.03 

0.05 

0.07 

0.09 

0.11 

Monthly payments (dollars) 







Table 9.2: Monthly payments over three years. 


(e) Describe as best you can the combinations of interest rates and durations of loans that 
result in a monthly payment of $200. 
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Functions of Several Variables 


Suppose we launch a projectile, using a golf club, a cannon, or some other device, from ground 
level. Under ideal conditions (ignoring wind resistance, spin, or any other forces except the force 
of gravity) the horizontal distance the object will travel depends on the initial velocity x the object 
is given, and the angle y at which it is launched. If we let / represent the horizontal distance the 
object travels (its range), then / is a function of the two variables x and y, and we represent / in 
functional notation by 

/<*,y) = 

9 


where g is the acceleration due to gravity. 1 


Definition 9.1. A function / of two independent variables is a rule that assigns to each or¬ 
dered pair (x, y ) in some set D exactly one real number f(x, y). 


There is, of course, no reason to restrict ourselves to functions of only two variables—we can 
use any number of variables we like. For example, 

f(x, y, z) = x 2 — 2 xz + cos (y) 

defines / as a function of the three variables x, y, and z. In general, a function of n independent 
variables is a rule that assigns to an ordered n-tuple (x\,X 2 , ■ • •, x n ) in some set D exactly one real 
number. 

As with functions of a single variable, it is important to understand the set of inputs for which 
the function is defined. 


Definition 9.2. The domain of a function / is the set of all inputs at which the function is 
defined. 


Activity 9.1. 

Identify the domain of each of the following functions. Draw a picture of each domain in the 
x-y plane. 

(a) f(x,y) = x 2 + y 2 

(b) f(x,y) = y/x 2 + y 2 

(c) Q(x,y) = ^ 

(d) s(x,y) = 

<1 


1 We will derive this equation in a later section. 
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Representing Functions of Two Variables 

One of the techniques we use to study functions of one variable is to create a table of values. We 
can do the same for functions of two variables, except that our tables will have to allow us to 
keep track of both input variables. We can do this with a 2-dimensional table, where we list the 
.x-values down the first column and the y-values across the first row. Let / be the function defined 
by f(x, y) = -t that gives the range of a projectile as a function of the initial velocity x and 

launch angle y of the projectile. The value f(x, y) is then displayed in the location where the x row 
intersects the y column, as shown in Table 9.3 (where we measure x in feet and y in radians). 


x\y 

0.2 

0.4 

0.6 

0.8 

1.0 

1.2 

1.4 

25 

7.6 

14.0 

18.2 

19.5 

17.8 

13.2 

6.5 

50 

30.4 

56.0 

72.8 

78.1 

71.0 

52.8 

26.2 

75 

68.4 

126.1 

163.8 

175.7 

159.8 

118.7 

58.9 

100 

121.7 

224.2 

291.3 

312.4 

284.2 

211.1 

104.7 

125 

190.1 

350.3 

455.1 

488.1 

444.0 

329.8 

163.6 

150 

273.8 

504.4 

655.3 

702.8 

639.3 

474.9 

235.5 

175 

372.7 

686.5 

892.0 

956.6 

870.2 

646.4 

320.6 

200 

486.8 

896.7 

1165.0 

1249.5 

1136.6 

844.3 

418.7 

225 

616.2 

1134.9 

1474.5 

1581.4 

1438.5 

1068.6 

530.0 

250 









Table 9.3: Values of f(x, y) = x sv ^ 2y \ 


Activity 9.2. 

Complete the last row in Table 9.3 to provide the needed values of the function /. 

<1 

If / is a function of a single variable x, then we define the graph of / to be the set of points of 
the form (x, / (a;)), where a: is in the domain of /. We then plot these points using the coordinate 
axes in order to visualize the graph. We can do a similar thing with functions of several variables. 
Table 9.3 identifies points of the form (x, y. f(x, y)), and we define the graph of / to be the set of 
these points. 


Definition 9.3. The graph of a function / = f(x, y) is the set of points of the form (x, y, f(x, y)), 
where the point (x, y) is in the domain of /. 


We also often refer to the graph of a function / of two variables as the surface generated by 
/. Points in the form (x, y, f(x, y)) are in three dimensions, so plotting these points takes a bit 
more work than graphs of functions in two dimensions. To plot these three-dimensional points, 
we need to set up a coordinate system with three mutually perpendicular axes - the x-axis, the 
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y-axis, and the z-axis (called the coordinate axes). There are essentially two different ways we could 
set up a 3D coordinate system, as shown in Figures 9.1 and 9.2; thus, before we can proceed, we 
need to establish a convention. 




Figure 9.1: A right hand system 


Figure 9.2: A left hand system 


The distinction between these two figures is subtle, but important. In the coordinate system 
shown in 9.1, imagine that you are sitting on the positive z-axis next to the label “z." Looking 
down at the x- and y-axes, you see that the y-axis is obtained by rotating the x-axis by 90° in the 
counterclockivise direction. Again sitting on the positive z-axis in Figure 9.2, you see that the y-axis 
is obtained by rotating the x-axis by 90° in the clockzvise direction. 

We call the coordinate system in 9.1 a right-hand system ; if we point the index finger of our right 
hand along the positive x-axis and our middle finger along the positive y-axis, then our thumb 
points in the direction of the positive z-axis. Following mathematical conventions, we choose to 
use a right-hand system throughout this book. 

Now that we have established a convention for a right hand system, we can draw a graph of 
the range function defined by /(x, y ) = x S1 ^ 2y \ Note that the function / is continuous in both 
variables, so when we plot these points in the right hand coordinate system, we can connect them 
all to form a surface in 3-space. The graph of the range function / is shown in Figure 9.3. 

There are many graphing tools available for drawing three-dimensional surfaces. 2 Since we 
will be able to visualize graphs of functions of two variables, but not functions of more than two 
variables, we will primarily deal with functions of two variables in this course. It is important to 
note, however, that the techniques we develop apply to functions of any number of variables. 

Notation: We let M 2 denote the set of all ordered pairs of real numbers in the plane (two copies of 
the real number system) and let M 3 represent the set of all ordered triples of real numbers (which 
constitutes three-space). 


2 e.g., Wolfram Alpha and http://web.monroecc.edu/manila/webfiles/calcNSF/JavaCode/ 

CalcPlot3D.htm 
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z 



0 

Figure 9.3: The range surface. 


Some Standard Equations in Three-Space 


In addition to graphing functions, we will also want to understand graphs of some simple equa¬ 
tions in three dimensions. For example, in M 2 , the graphs of the equations x = a and y = b, where 
a and b are constants, are lines parallel to the coordinate axes. In the next activity we consider 
their three-dimensional analogs. 

Activity 9.3. 

(a) Consider the set of points (x, y, z) that satisfy the equation x = 2. Describe this set as 
best you can. 

(b) Consider the set of points (x, y, z) that satisfy the equation y = —1. Describe this set as 
best you can. 

(c) Consider the set of points (x, y, z) that satisfy the equation z = 0. Describe this set as 
best you can. 

<1 


Activity 9.3 shows that the equations where one independent variable is constant lead to 
planes parallel to ones that result from a pair of the coordinate axes. When we make the con¬ 
stant 0, we get the coordinate planes. The xy-plane satisfies z = 0, the .xz-plane satisfies y = 0, and 
the yz-plane satisfies z = 0 (see Figure 9.4). 

On a related note, we define a circle in M 2 as the set of all points equidistant from a fixed point. 
In M 3 , we call the set of all points equidistant from a fixed point a sphere. To find the equation of 
a sphere, we need to understand how to calculate the distance between two points in three-space, 
and we explore this idea in the next activity. 

Activity 9.4. 

Let P = (xo, yo, zq ) and Q = {x\ .y\.z\) be two points in M 3 . These two points form opposite 
vertices of a rectangular box whose sides are planes parallel to the coordinate planes as illus- 
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z 
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X / 

X 




X/ 



Figure 9.4: The coordinate planes. 

trated in Figure 9.5, and the distance between P and Q is the length of the diagonal shown in 
Figure 9.5. 

{x 1 ,y u zi) 



Figure 9.5: The distance formula in M 3 . 


(a) Consider one of the right triangles in the base of the box whose hypotenuse is shown as 
the red line in Figure 9.5. What are the vertices of this triangle? Since this right triangle 
lies in a plane, we can use the Pythagorean Theorem to find a formula for the length of 
the hypotenuse of this triangle. Find such a formula, which will be in terms of xq, ijo, 
x\, and y \. 

(b) Now notice that the triangle whose hypotenuse is the blue segment connecting the 
points P and Q with a leg as the hypotenuse of the triangle found in part (a) lies en¬ 
tirely in a plane, so we can again use the Pythagorean Theorem to find the length of its 
hypotenuse. Explain why the length of this hypotenuse, which is the distance between 
the points P and Q, is 

yj (xi - x 0 ) 2 + (yi - 2/o) 2 + (zi - A)) 2 - 


<1 
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The formula developed in Activity 9.4 is important to remember. 

The distance between points P = (xo,yo,zo) and Q = (x\, y\ , z\ ) (denoted as \PQ\) in M 3 is 
given by the formula 

\PQ\ = v 7 (h - x o) 2 + (yi - yo ) 2 + Oi - z 0 ) 2 . (9.2) 


Equation (9.2) can be used to derive the formula for a sphere centered at a point (xo, yo, zq) 
with radius r. Since the distance from any point (x, y, z) on such a sphere to the point (xo, yo, zq) 
is r, the point (x, y, z ) will satisfy the equation 

y/ix- x 0 ) 2 + (y- yo ) 2 + (z - z 0 ) 2 = r 

Squaring both sides, we come to the standard equation for a sphere. 

The equation of a sphere with center (xo, yo, zo) and radius r is 

(x - X 0 ) 2 + {y- yof + {z- zof = r 2 . 


This makes sense if we compare this equation to its two-dimensional analogue, the equation 
of a circle of radius r in the plane centered at (xo, yo)' 

(x - X 0 ) 2 + (y- yo f = r 2 . 


Traces 

When we study functions of several variables we are often interested in how each individual vari¬ 
able affects the function in and of itself. In Preview Activity 9.1, we saw that the monthly payment 
on an $18,000 loan depends on the interest rate and the duration of the loan. However, if we fix 
the interest rate, the monthly payment depends only on the duration of the loan, and if we set the 
duration the payment depends only on the interest rate. This idea of keeping one variable con¬ 
stant while we allow the other to change will be an important tool for us when studying functions 
of several variables. 

As another example, consider again the range function / defined by 

x 2 sin(2y) 

/(x, y) = - 

9 

where x is the initial velocity of an object in feet per second, y is the launch angle in radians, and g 
is the acceleration due to gravity (32 feet per second squared). If we hold the launch angle constant 
at y = 0.6 radians, we can consider / a function of the initial velocity alone. In this case we have 

x 2 

f(x) = — sin(2 • 0.6). 
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We can plot this curve on the surface by tracing out the points on the surface when y = 0.6, as 
shown in Figure 9.6. The graph and the formula clearly show that / is quadratic in the rr-direction. 
More descriptively, as we increase the launch velocity while keeping the launch angle constant, 
the range increases proportional to the square of the initial velocity. 

Similarly, if we fix the initial velocity at 150 feet per second, we can consider the range as a 
function of the launch angle only. In this case we have 

150 2 sin(2y) 
m = -32-' 

We can again plot this curve on the surface by tracing out the points on the surface when x = 150, 
as shown in Figure 9.7. The graph and the formula clearly show that / is sinusoidal in the y- 
direction. More descriptively, as we increase the launch angle while keeping the initial velocity 
constant, the range is proportional to the sine of twice the launch angle. 




Figure 9.6: The trace with y = 0.6. Figure 9.7: The trace with x = 150. 


The curves we define when we fix one of the independent variables in our two variable func¬ 
tion are called traces. 


Definition 9.4. A trace of a function / of two independent variables x and y is a curve of the 
form z = f(c, y) or z = f(x, c), where c is a constant. 


Understanding trends in the behavior of functions of two variables can be challenging, as can 
sketching their graphs; traces help us with each of these tasks. 

Activity 9.5. 

In the following questions, we investigate the use of traces to better understand a function 
through both tables and graphs. 

(a) Identify the y = 0.6 trace for the range function / defined by f(x, y) = x by 
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highlighting or circling the appropriate cells in Table 9.3. Write a sentence to describe 
the behavior of the function along this trace. 

(b) Identify the x = 150 trace for the range function by highlighting or circling the appro¬ 
priate cells in Table 9.3. Write a sentence to describe the behavior of the function along 
this trace. 



Figure 9.8: Coordinate axes to sketch traces. 

(c) For the function g defined by g(x. y) = x 2 + y 2 + 1, explain the type of function that each 
trace in the x direction will be (keeping y constant). Plot the y = —4 , y = —2, y = 0, 
y = 2, and y = 4 traces in 3-dimensional coordinate system provided in Figure 9.8. 

(d) For the function g defined by g(x, y) = x 2 + y 2 +1, explain the type of function that each 
trace in the y direction will be (keeping x constant). Plot the x = —4, x = —2, x = 0, 
x = 2, and x = 4 traces in 3-dimensional coordinate system in Figure 9.8. 

(e) Describe the surface generated by the function g. 


<\ 


Contour Maps and Level Curves 

We have all seen topographic maps such as the one of the Porcupine Mountains in the upper 
peninsula of Michigan shown in Figure 9.9. 3 The curves on these maps show the regions of con¬ 
stant altitude. The contours also depict changes in altitude: contours that are close together signify 
steep ascents or descents, while contours that are far apart indicate only slight changes in eleva¬ 
tion. Thus, contour maps tell us a lot about three-dimensional surfaces. Mathematically, if f(x, y) 
represents the altitude at the point (x, y), then each contour is the graph of an equation of the form 
f(x, y) = k, for some constant k. 

3 Map source: Michigan Department of Natural Resources, https : //www .michigan. gov/dnr/0, 4570, 

7-153-10369_46675_58093-, 00 . html, with permission of the Michigan DNR and Bob Wild. A picture of 

Lake of the Clouds (fro Lars Jensen) in shown on the cover. 
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Figure 9.9: Contour map of the Porcupine Mountains. 
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Activity 9.6. 

On the topographical map of the Porcupine Mountains in Figure 9.9, 

(a) identify the highest and lowest points you can find; 

(b) from a point of your choice, determine a path of steepest ascent that leads to the highest 
point; 

(c) from that same initial point, determine the least steep path that leads to the highest 
point. 
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Definition 9.5. A level curve (or contour) of a function / of two independent variables x and 
y is a curve of the form k = f(x. y ), where A; is a constant. 


Topographical maps can be used to create a three-dimensional surface from the two-dimensional 
contours or level curves. For example, level curves of the range function defined by f(x, y) = 

X S 32 2y ) plotted in the xy -plane are shown in Figure 9.10. If we lift these contours and plot them 
at their respective heights, then we get a picture of the surface itself, as illustrated in Figure 9.11. 



Z 



Figure 9.11: Level curves at the ap¬ 
propriate height. 


The use of level curves and traces can help us construct the graph of a function of two variables. 

Activity 9.7. 

(a) Let f(x,y) = x 2 + y 2 . Draw the level curves f(x,y) = k for k = 1, k = 2, k = 3, and 
k = 4 on the left set of axes given in Figure 9.12. (You decide on the scale of the axes.) 
Explain what the surface defined by / looks like. 
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Figure 9.12: Left: Level curves for f(x, y) = x 2 + y 2 . Right: Level curves for g(x, y) = \J x 2 + y 2 . 

(b) Let g(x, y) = \/x 2 + y 2 . Draw the level curves g(x, y) = k for k = 1, k = 2, k = 3, and 
k = 4 on the right set of axes given in Figure 9.12. (You decide on the scale of the axes.) 
Explain what the surface defined by g looks like. 

(c) Compare and contrast the graphs of / and g. How are they alike? How are they differ¬ 
ent? Use traces for each function to help answer these questions. 


<1 

The traces and level curves of a function of two variables are curves in space. In order to 
understand these traces and level curves better, we will first spend some time learning about 
vectors and vector-valued functions in the next few sections and return to our study of functions 
of several variables once we have those more mathematical tools to support their study. 


A gallery of functions 

We end this section by considering a collection of functions and illustrating their graphs and some 
level curves. 





Figure 9.13: z = x 2 + y 2 
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Figure 9.14: z = 4 — (x 2 + y 2 ) 




Figure 9.15: z = \/ x 2 + y 2 
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Figure 9.17: z = sin(x) + sin(y) 
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Figure 9.18: z = y 2 — x 3 + x 





Figure 9.19: z 


xye 


-x 2 -y 


2 


Summary 

• A function / of several variables is a rule that assigns a unique number to an ordered collection 
of independent inputs. The domain of a function of several variables is the set of all inputs for 
which the function is defined. 

• In M 3 , the distance between points P = (xo> 2/o> A)) an d Q = ( x i,yi,zi) (denoted as \PQ\) is 
given by the formula 

\PQ\ = y 7 (-u - z 0 ) 2 + (yi - yo ) 2 + (u - z 0 ) 2 . 

and thus the equation of a sphere with center (xo,yo, zo) and radius r is 

{x - x 0 ) 2 + {y - yo) 2 + (z- z 0 ) 2 = r 2 . 

• A trace of a function / of two independent variables x and y is a curve of the form z = f(c, y) 
or z = f(x, c), where c is a constant. A trace tells us how the function depends on a single 
independent variable if we treat the other independent variable as a constant. 
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• A level curve of a function / of two independent variables x and y is a curve of the form 
k = f(x,y), where A; is a constant. A level curve describes the set of inputs that lead to a 
specific output of the function. 


Exercises 

1. Find the equation of each of the following geometric objects. 

(a) The plane parallel to the x-y plane that passes through the point (—4, 5, —12). 

(b) The plane parallel to the y-z plane that passes through the point (7, —2, —3). 

(c) The sphere centered at the point (2,1,3) and has the point (—1,0, —1) on its surface. 

(d) The sphere whose diameter has endpoints (—3,1, —5) and (7,9, —1). 


2 . 


The Ideal Gas Law, PV = RT, relates the pressure ( P , in pascals), temperature (T, in Kelvin), 
and volume (V, in cubic meters) of 1 mole of a gas (R = 8.314 ^ K is the universal gas con¬ 

stant), and describes the behavior of gases that do not liquefy easily, such as oxygen and hy¬ 
drogen. We can solve the ideal gas law for the volume and hence treat the volume as a function 
of the pressure and temperature: 


V(P,T) 


8.314T 

P 


(a) Explain in detail what the trace of V with P = 1000 tells us about a key relationship 
between two quantities. 

(b) Explain in detail what the trace of V with T = 5 tells us. 

(c) Explain in detail what the level curve V = 0.5 tells us. 

(d) Use 2 or three additional traces in each direction to make a rough sketch of the surface 
over the domain of V where P and T are each nonnegative. Write at least one sentence 
that describes the way the surface looks. 

(e) Based on all your work above, write a couple of sentences that describe the effects that 
temperature and pressure have on volume. 

3. Consider the function h defined by h(x. y) = 8 - y^4 — x 2 — y 2 . 

(a) What is the domain of h ? (Hint: describe a set of ordered pairs in the plane by explain¬ 
ing their relationship relative to a key circle.) 

(b) The range of a function is the set of all outputs the function generates. Given that the 
range of the square root function g(t ) = \ft is the set of all nonnegative real numbers, 
what do you think is the range of hi Why? 

(c) Choose 4 different values from the range of h and plot the corresponding level curves 
in the plane. What is the shape of a typical level curve? 
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(d) Choose 5 different values of x (including at least one negative value and zero), and 
sketch the corresponding traces of the function h. 

(e) Choose 5 different values of y (including at least one negative value and zero), and 
sketch the corresponding traces of the function h. 

(f) Sketch an overall picture of the surface generated by h and write at least one sentence 
to describe how the surface appears visually. Does the surface remind you of a familiar 
physical structure in nature? 
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9.2 Vectors 

Motivating Questions 

In this section, zve strive to understand the ideas generated by the following important questions: 

• What is a vector? 

• What does it mean for two vectors to be equal? 

• How do we add two vectors together and multiply a vector by a scalar? 

• How do we determine the magnitude of a vector? What is a unit vector, and how do we 
find a unit vector in the direction of a given vector? 


Introduction 

If we are at a point x in the domain of a function of one variable, there are only two directions in 
which we can move: in the positive or negative /--direction. If, however, we are at a point (x, y) in 
the domain of a function of two variables, there are many directions in which we can move. Thus, 
it is important for us to have a means to indicate direction, and we will do so using vectors. 

Preview Activity 9.2. After working out, Sarah and John leave the Recreation Center on the Grand 
Valley State University Allendale campus (a map of which is given in Figure 9.20) to go to their 
next classes . 4 Suppose we record Sarah's movement on the map in a pair (x, y) (we will call this 
pair a vector), where x is the horizontal distance (in feet) she moves (with east as the positive 
direction) and y as the vertical distance (in feet) she moves (with north as the positive direction). 
We do the same for John. Throughout, use the legend to estimate your responses as best you can. 

(a) What is the vector vi = (x. y) that describes Sarah's movement if she walks directly in a 
straight line path from the Recreation Center to the entrance at the northwest end of Mack¬ 
inac Hall? (Assume a straight line path, even if there are buildings in the way.) Explain 
how you found this vector. What is the total distance in feet between the Recreation Center 
and the entrance to Mackinac Hall? Measure the number of feet directly and then explain 
how to calculate this distance in terms of x and y. 

(b) What is the vector V 2 = (x, y) that describes John's change in position if he walks directly 
from the Recreation Center to Au Sable Hall? How many feet are there between Recreation 
Center to Au Sable Hall in terms of x and y? 

(c) What is the vector V 3 = (x. y) that describes the change in position if John walks directly 
from Au Sable Hall to the northwest entrance of Mackinac Hall to meet up with Sarah after 

4 GVSU campus map from http : / /www . gvsu . edu/homepage/f iles/pdf/maps/allendale . pdf, used with 
permission from GVSU, credit to illustrator Chris Bessert. 
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class? What relationship do you see among the vectors vi, V 2 , and V 3 ? Explain why this 
relationship should hold. 


CXI 



GrandWuhy 
State University. 


Allendale Campus 



Figure 9.20: Grand Valley State University Allendale campus map. 
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Representations of Vectors 

Preview Activity 9.2 shows how we can record the magnitude and direction of a change in po¬ 
sition using an ordered pair of numbers (x,y). There are many other quantities, such as force 
and velocity, that possess the attributes of magnitude and direction, and we will call each such 
quantity a vector. 


Definition 9.6. A vector is any quantity that possesses the attributes of magnitude and direc¬ 
tion. 


We can represent a vector geometrically as a directed line segment, with the magnitude as the 
length of the segment and an arrowhead indicating direction, as shown in Figure 9.21. 




Figure 9.21: A vector. 


Figure 9.22: Representations of the same vector. 


According to the definition, a vector possesses the attributes of length (magnitude) and direc¬ 
tion; the vector's position, however, is not mentioned. Consequently, we regard as equal any two 
vectors having the same magnitude and direction, as shown in Figure 9.22. 


Two vectors are equal provided they have the same magnitude and direction. 


This means that the same vector may be drawn in the plane in many different ways. For 
instance, suppose that we would like to draw the vector (3,4), which represents a horizontal 
change of three units and a vertical change of four units. We may place the tail of the vector (the 
point from which the vector originates) at the origin and the tip (the terminal point of the vector) 
at (3,4), as illustrated in Figure 9.23. A vector with its tail at the origin is said to be in standard 
position. 

Alternatively, we may place the tail of the vector (3,4) at another point, such as Q( 1,1). After 
a displacement of three units to the right and four units up, the tip of the vector is at the point 
R( 4,5) (see Figure 9.24). 
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Figure 9.23: A vector in standard position Figure 9.24: A vector between two points 


In this example, the vector led to the directed line segment from Q to R, which we denote as 
Q fl. We may also turn the situation around: given the two points Q and R, we obtain the vector 
(3,4) because we move horizontally three units and vertically four units to get from Q to R. In 
other words, QII = (3,4). In general, the vector QII from the point Q = (q \. qp) to R = (r,, r 2 ) is 
found by taking the difference of coordinates, so that 

Q& = (fi - qi,r 2 - q 2 ). 


We will use boldface letters to represent vectors, such as v = (3,4), to distinguish them from 
scalars. The entries of a vector are called its components ; in the vector (3,4), the x component is 3 
and the y component is 4. We use pointed brackets ( ,) and the term components to distinguish a 
vector from a point ( ,) and its coordinates. There is, however, a close connection between vectors 
and points. Given a point P, we will frequently consider the vector OP from the origin 0 to P. 
For instance, if P = (3,4), then 01^ = (3,4) as in Figure 9.25. In this way, we think of a point P as 
defining a vector OP whose components agree with the coordinates of P. 



Figure 9.25: A point defines a vector 


While we often illustrate vectors in the plane since it is easier to draw pictures, different sit¬ 
uations call for the use of vectors in three or more dimensions. For instance, a vector v in n- 
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dimensional space, M n , has n components and may be represented as 

V = (vi,v 2 ,v 3 ,.. .,v n ). 

The next activity will help us to become accustomed to vectors and operations on vectors in 
three dimensions. 

Activity 9.8. 

As a class, determine a coordinatization of your classroom, agreeing on some convenient set of 
axes (e.g., an intersection of walls and floor) and some units in the x, y, and z directions (e.g., 
using lengths of sides of floor, ceiling, or wall tiles). Let O be the origin of your coordinate 
system. Then, choose three points. A, B, and C in the room, and complete the following. 

(a) Determine the coordinates of the points A, B, and C. 

(b) Determine the components of the indicated vectors. 

(i) ol (ii) (iii) ot (iv) A& (v) A$ (vi) B& 
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Equality of Vectors 

Because location is not mentioned in the definition of a vector, any two vectors that have the same 
magnitude and direction are equal. It is helpful to have an algebraic way to determine when this 
occurs. That is, if we know the components of two vectors u and v, we will want to be able to 
determine algebraically when u and v are equal. There is an obvious set of conditions that we use. 

Two vectors u = (u\,u 2 ) and v = (vi,v 2 ) in M 2 are equal if and only if their corresponding 
components are equal: u\ = v\ and u 2 = v 2 . More generally, two vectors u = (ui,u 2 ,..., u n ) 
and v = (v\,v 2 ,, v n ) in M n are equal if and only if m = v t for each possible value of i. 


Operations on Vectors 

Vectors are not numbers, but we can now represent them with components that are real numbers. 
As such, we naturally wonder if it is possible to add two vectors together, multiply two vectors, 
or combine vectors in any other ways. In this section, we will study two operations on vectors: 
vector addition and scalar multiplication. To begin, we investigate a natural way to add two 
vectors together, as well as to multiply a vector by a scalar. 

Activity 9.9. 

Let u = (2,3), v = (-1,4). 

(a) Using the two specific vectors above, what is the natural way to define the vector sum 
u +v? 
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(b) In general, how do you think the vector sum a+b of vectors a = (ai, < 22 ) and b = ( 61 , 62 ) 
in M 2 should be defined? Write a formal definition of a vector sum based on your 
intuition. 

(c) In general, how do you think the vector sum a + b of vectors a = ( 01 , 02 , 03 ) and 
b = ( 61 , 62 ,^ 3 ) in M 3 should be defined? Write a formal definition of a vector sum 
based on your intuition. 

(d) Returning to the specific vector v = (—1,4) given above, what is the natural way to 
define the scalar multiple y v? 

(e) In general, how do you think a scalar multiple of a vector a = (ai, 02 ) in M 2 by a scalar 
c should be defined? how about for a scalar multiple of a vector a = (oi, 02,03) in M 3 
by a scalar c? Write a formal definition of a scalar multiple of a vector based on your 
intuition. 
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We can now add vectors and multiply vectors by scalars, and thus we can add together scalar 
multiples of vectors. This allows us to define vector subtraction, v — u, as the sum of v and —1 times 
u, so that 

v — u = v + (—l)u. 

Using vector addition and scalar multiplication, we will often represent vectors in terms of the 
special vectors i = (1,0) and j = (0,1). For instance, we can write the vector (a, 6) in M 2 as 

(a, 6) = a( 1,0) + 6(0,1) = ai + 6j, 

which means that 

(2,-3) = 2i-3j. 

In the context of M 3 , we let i = (1,0,0), j = (0,1,0), and k = (0,0,1), and we can write the vector 

(a, 6 , c) in M 3 as 

(a, 6, c) = a(l, 0, 0) + 6(0,1, 0) + c(0,0,1) = ai + 6j + ck. 

The vectors i, j, and k are called the standard unit vectors 5 , and are important in the physical sci¬ 
ences. 


Properties of Vector Operations 

We know that the scalar sum 1 + 2 is equal to the scalar sum 2 + 1 . This is called the commutative 
property of scalar addition. Any time we define operations on objects (like addition of vectors) we 
usually want to know what kinds of properties the operations have. For example, is addition of 
vectors a commutative operation? To answer this question we take two arbitrary vectors v and u 

5 As we will learn momentarily, unit vectors have length 1. 
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and add them together and see what happens. Let v = (v±, V 2 ) and u = {u\,U 2 )- Now we use the 
fact that v\, 02, u \ , and 112 are scalars, and that the addition of scalars is commutative to see that 

V + U = (vi, V2) + (ui,U 2 ) = (Vl +Ul,V2+ U2) = (Ul + V±,U 2 + V 2 ) = (^ 1 , ^2) + {vi,V 2 ) = U + V. 

So the vector sum is a commutative operation. Similar arguments can be used to show the follow¬ 
ing properties of vector addition and scalar multiplication. 

Let v, u, and w be vectors in M" and let a and b be scalars. Then 

1. v + u = u + v 

2. (v + u) + w = v + (u + w) 

3. The vector 0 = (0, 0,..., 0) has the property that v + 0 = v. The vector 0 is called the 

zero vector. 

4. (—l)v + v = 0. The vector (—l)v = —v is called the additive inverse of the vector v. 

5. (a + 6)v = av + 6v 

6. a(v + u) = av + au 

7. (a6)v = a(6v) 

8. lv = v. 


We verified the first property for vectors in M 2 ; it is straightforward to verify that the rest of 
the eight properties just noted hold for all vectors in W 1 . 

Geometric Interpretation of Vector Operations 

Next, we explore a geometric interpretation of vector addition and scalar multiplication that al¬ 
lows us to visualize these operations. Let u = (4, 6) and v = (3, —2). Then w = u + v = (7,4), as 
shown on the left in Figure 9.26. 

If we think of these vectors as displacements in the plane, we find a geometric way to envision 
vector addition. For instance, the vector u + v will represent the displacement obtained by follow¬ 
ing the displacement u with the displacement v. We may picture this by placing the tail of v at the 
tip of u, as seen in the center of Figure 9.26. 

Of course, vector addition is commutative so we obtain the same sum if we place the tail of u 
at the tip of v. We therefore see that u+ v appears as the diagonal of the parallelogram determined 
by u and v, as shown on the right of Figure 9.26. 

Vector subtraction has a similar interpretation. On the left in Figure 9.27, we see vectors u, v, 
and w = u + v. If we rewrite v = w — u, we have the arrangement of Figure 9.28. In other words, 
to form the difference w — u, we draw a vector from the tip of u to the tip of w. 
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Figure 9.26: A vector sum (left), summing displacements (center), the parallelogram law (right) 




Figure 9.27: Vector addition Figure 9.28: Vector subtraction 


In a similar way, we may geometrically represent a scalar multiple of a vector. For instance, 
if v = (2,3), then 2v = (4, 6). As shown in Figure 9.29, multiplying v by 2 leaves the direction 
unchanged, but stretches v by 2. Also, —2v = (—4, —6), which shows that multiplying by a 
negative scalar gives a vector pointing in the opposite direction of v. 
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Figure 9.29: Scalar multiplication 


Activity 9.10. 




Figure 9.30 Figure 9.31 

Suppose that u and v are the vectors shown in Figure 9.30. 

(a) On Figure 9.30, sketch the vectors u + v, v — u, 2u, —2u, and —3v. 

(b) What is Ov? 

(c) On Figure 9.31, sketch the vectors —3v, — 2v, — lv, 2v, and 3v. 

(d) Give a geometric description of the set of vectors tv where t is any scalar. 

(e) On Figure 9.31, sketch the vectors u — 3v, u — 2v, u — v, u + v, and u + 2v. 

(f) Give a geometric description of the set of vectors u + tv where t is any scalar. 
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The Magnitude of a Vector 

By definition, vectors have both direction and magnitude (or length). We now investigate how to 
calculate the magnitude of a vector. Since a vector v can be represented by a directed line segment. 
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we can use the distance formula to calculate the length of the segment. This length is the magnitude 
of the vector v and is denoted |v|. 

Activity 9.11. 




Vi 


Figure 9.32: The vector defined by A and B. 


Figure 9.33: An arbitrary vector, v. 


(a) Fet A = (2, 3) and B = (4, 7), as shown in Figure 9.32. Compute \A1^\. 

(b) Fet v = {v\,V 2 ) be the vector in IR 2 with components v\ and v -2 as shown in Figure 9.33. 
Use the distance formula to find a general formula for |v|. 

(c) Fet v = vg) be a vector in M 3 . Use the distance formula to find a general formula 

for |v|. 

(d) Suppose that u = (2,3) and v = (—1,2). Find |u|, |v|, and |u + v|. Is it true that 
|u + v| = |u| + |v|? 

(e) Under what conditions will |u + v| = |u| + |v|? (Hint: Think about how u, v, and u + v 
form the sides of a triangle.) 

(f) With the vector u = (2,3), find the lengths of 2u, 3u, and —2u, respectively, and use 
proper notation to label your results. 

(g) If t is any scalar, how is \tu\ related to |u|? 

(h) A unit vector is a vector whose magnitude is 1. Of the vectors i, j, and i + j, which are 
unit vectors? 

(i) Find a unit vector v whose direction is the same as u = (2, 3). (Hint: Consider the result 
of part (g).) 
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Summary 

• A vector is any object that possesses the attributes of magnitude and direction. Examples of 
vector quantities are position, velocity, acceleration, and force. 
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• Two vectors are equal if they have the same direction and magnitude. Notice that position is 
not considered, so a vector is independent of its location. 

• If u = (ui,U 2 , • • •, u n ) and v = (v\,V 2 , ■ ■ ■, v n ) are two vectors in M n , then their vector sum is 
the vector 


u v v = (m +Vl,U 2 + V 2 ,...,U n + V n }. 


If u = (ui, U2, ■ ■ ■, u n ) is a vector in M n and c is a scalar, then the scalar multiple cu is the vector 


CU = (cUl,CU2, • ■ • ,CU n ). 


• The magnitude of the vector v = (v\,V2,- ■ ■, v n ) in M n is the scalar 



A vector u is a unit vector provided that |u| = 1. If v is a nonzero vector, then the vector ^ is 
a unit vector with the same direction as v. 


Exercises 

1. Let v = (1, —2), u = (0,4), and w = (—5, 7). 

(a) Determine the components of the vector u — v. 

(b) Determine the components of the vector 2v — 3u. 

(c) Determine the components of the vector v + 2u — 7w. 

(d) Determine scalars a and b such that av + bu = w. 

2. Let u = (2,1) and v = (1, 2). 

(a) Determine the components and draw geometric representations of the vectors 2u, \u, 
(—l)u, and (—3)u on the same set of axes. 

(b) Determine the components and draw geometric representations of the vectors u + v, 
u + 2v, and u + 3v. 

(c) Determine the components and draw geometric representations of the vectors u — v, 
u — 2v, and u — 3v. 

(d) Recall that u — v = u + (—l)v. Use the "tip to tail" perspective for vector addition to 
explain why the difference u — v can be viewed as a vector that points from the tip of v 
to the tip of u. 

3. Recall that given any vector v, we can calculate its length, |v|. Also, we say that two vectors 
that are scalar multiples of one another are parallel. 
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(a) Let v = (3,4) in M 2 . Compute | v|, and determine the components of the vector u = j^v. 
What is the magnitude of the vector u? How does its direction compare to v? 

(b) Let w = 3i — 3j in M 2 . Determine a unit vector u in the same direction as w. 

(c) Let v = (2, 3, 5) in M 3 . Compute |v|, and determine the components of the vector u = 
mV. What is the magnitude of the vector u? How does its direction compare to v? 

(d) Let v be an arbitrary nonzero vector in M 3 . Write a general formula for a unit vector 
that is parallel to v. 
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9.3 The Dot Product 

Motivating Questions 

In this section, zve strive to understand the ideas generated by the following important questions: 

• How is the dot product of two vectors defined and what geometric information does it tell 
us? 

• How can we tell if two vectors in M n are perpendicular? 

• How do we find the projection of one vector onto another? 


Introduction 

In the last section, we considered vector addition and scalar multiplication and found that each 
operation had a natural geometric interpretation. In this section, we will introduce a means of 
multiplying vectors. 

Preview Activity 9.3. For two-dimensional vectors u = (ui,U 2 ) and v = (v \, v-i), the dot product 
is simply the scalar obtained by 

U • V = U\V\ + U2V2- 


(a) If u = (3,4) and v = (—2,1), find the dot product u • v. 

(b) Find i • i and i • j. 

(c) If u = (3,4), find u • u. How is this related to |u|? 

(d) On the axes in Figure 9.34, plot the vectors u = (1, 3) and v = (—3, 1). Then, find u • v. 
What is the angle between these vectors? 
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Figure 9.34: For part (d) 
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Figure 9.35: For part (e) 


(e) On the axes in Figure 9.35, plot the vector u = (1,3). 

For each of the following vectors v, plot the vector on Figure 9.35 and then compute the 
dot product u • v. 

. v = (3,2). 

• v = (3,0). 

• v = (3,-1). 

. v = (3, —2). 

• v = (3, —4). 

(f) Based upon the previous part of this activity, what do you think is the sign of the dot 
product in the following three cases shown in Figure 9.36? 




Figure 9.36: For part (f) 
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The Dot Product 

If we have two n-dimensional vectors u = (m, 112, ■ ■ ■ , u n ) and v = (v\, V2, ■ ■ ■, v n ), we define their 
dot product 6 as 

u • v = U\V\ + U2V2 + ... + u n v n . 

For instance, we find that 

(3,0,1) • (-2,1,4) = 3 • (-2) + 0 • 1 + 1 • 4 = —6 + 0 + 4 = —2. 

Notice that the resulting quantity is a scalar. Our work in Preview Activity 9.3 examined dot 
products of two-dimensional vectors. 

Activity 9.12. 

Determine each of the following. 

(a) <1,2, -3) - <4, -2,0). 

(b) (0,3,-2,1)-(5,-6,0,4) 

<1 

The dot product is a natural way to define a product of two vectors. In addition, it behaves in 
ways that are similar to the product of, say, real numbers. 

Let u, v, and w be vectors in R n . Then 

(a) u ■ v = v ■ u (the dot product is commutative), and 

(b) if c is a scalar, then (cu + v) • w = c(u ■ w) + v ■ w (the dot product is bilinear), 


Moreover, the dot product gives us valuable geometric information about the vectors and their 
relative orientation. For instance, let's consider what happens when we dot a vector with itself: 

U • U = (UI,U2, ...,Un)' {ui,U 2 , ■ ■ ■ , U n ) = u\ + u\ + . . . + = |u| 2 . 

In other words, the dot product of a vector with itself gives the square of the length of the vector: 
u • u = |u| 2 . 

The angle between vectors 

The dot product can help us understand the angle between two vectors. For instance, if we are 
given two vectors u and v, there are two angles that these vectors create, as depicted in Figure 
9.37. We will call 6, the smaller of these angles, the angle between these vectors. Notice that 9 lies 
between 0 and 7 r. 

6 As we will see shortly, the dot product arises in physics to calculate the work done by a vector force in a given direc¬ 
tion. It might be more natural to define the dot product in this context, but it is more convenient from a mathematical 
perspective to define the dot product algebraically and then view work as an application of this definition. 
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Figure 9.37: The angle between vectors u 
and v. 



Figure 9.38: The triangle formed by u, v, and 
u — v. 


To determine this angle, we may apply the Faw of Cosines to the triangle shown in Figure 9.38. 

Using the fact that the dot product of a vector with itself gives us the square of its length, 
together with the bilinearity of the dot product, we find: 

|u — v | 2 = |u | 2 + |v | 2 - 2 |u||v| cos(< 9 ) 

(u — v) • (u — v) = u • u + v • v — 21u| | v| cos (6) 

u • (u — v) — v • (u — v) = u • u + v • v — 21u| | v| cos($) 

u u — 2u • v + v • v = u • u + v • v — 21u| |v| cos(6) 

— 2 u • v = — 2 1 u| | v| cos(9) 

u-v = |u||v| cos(0). 

To summarize, we have the important relationship 

U • V = U\V\ + U 2 V 2 + ... + u n v n = |u| | v| cos (9). ( 9 . 3 ) 


It is sometimes useful to think of Equation (9.3) as giving us an expression for the angle be¬ 
tween two vectors: 


6 = cos 


-i 


U • V 


U V 


The real beauty of this expression is this: the dot product is a very simple algebraic operation to 
perform yet it provides us with important geometric information - namely the angle between the 
vectors - that would be difficult to determine otherwise. 


Activity 9.13. 

Determine each of the following. 

(a) The length of the vector u = (1, 2, —3) using the dot product. 

(b) The angle between the vectors u = (1,2) and v = (4,-1) to the nearest tenth of a 
degree. 


@©@© 






9.3. THE DOT PRODUCT 


35 


(c) The angle between the vectors y = (1,2, —3) and z = (—2,1,1) to the nearest tenth of a 


degree. 


(d) If the angle between the vectors u and v is a right angle, what does the expression 


u • v = |u| | v| cos 6 say about their dot product? 

(e) If the angle between the vectors u and v is acute—that is, less than it/2 —what does the 


expression u • v = |u| | v| cos 6 say about their dot product? 

(f) If the angle between the vectors u and v is obtuse—that is, greater than tt /2—what does 
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The Dot Product and Orthogonality 

When the angle between two vectors is a right angle, it is frequently the case that something 
important is happening. In this case, we say the vectors are orthogonal. For instance, orthogonality 
often plays a role in optimization problems; to determine the shortest path from a point in M 3 to a 
given plane, we move along a line orthogonal to the plane. 

As Activity 9.13 indicates, the dot product provides a simple means to determine whether two 
vectors are orthogonal to one another. In this case, u • v = |u||v| cos(7r/2) = 0, so we make the 
following important observation. 

Two vectors u and v in M rt are orthogonal to each other if u • v = 0. 


More generally, the sign of the dot product gives us useful information about the relative ori¬ 
entation of the vectors. If we remember that 


cos(0) >0 if 9 is an acute angle, 
cos(0) =0 if 6 is a right angle. 


and cos(0) <0 if 9 is an obtuse angle. 


we see that 


u • v > 0 if 6 is an acute angle, 
u • v = 0 if 0 is a right angle. 


and u • v < 0 if 6 is an obtuse angle. 


This is illustrated in Figure 9.39. 
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Figure 9.39: The orientation of vectors 


Work, Force, and Displacement 

In physics, work is a measure of the energy required to apply a force to an object through a dis¬ 
placement. For instance. Figure 9.40 shows a force F displacing an object from point A to point B. 
The displacement is then represented by the vector Afi. 


F 



Figure 9.40: A force F displacing an object. 

It turns out that the work required to displace the object is 

W = F-Aj3 = |F||I^| cos(0). 

This means that the work is determined only by the magnitude of the force applied parallel to the 
displacement. Consequently, if we are given two vectors u and v, we would like to write u as a 
sum of two vectors, one of which is parallel to v and one of which is orthogonal to v. We take up 
this task after the next activity. 

Activity 9.14. 

Determine the work done by a 25 pound force acting at a 30° angle to the direction of the 
object's motion, if the object is pulled 10 feet. In addition, is more work or less work done if the 
angle to the direction of the object's motion is 60°? 
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Projections 




Figure 9.41: The projection of u onto v. Figure 9.42: proj v u when ()>?>. 

Suppose we are given two vectors u and v as shown in Figure 9.41. Motivated by our discus¬ 
sion of work, we would like to write u as a sum of two vectors, one of which is parallel to v and 
one of which is orthogonal. That is, we would like to write 

u = proj v u + proj ±v u, (9.4) 

where proj v u is parallel to v and proj v u is orthogonal to v. We call the vector proj v u the projection 
of u onto v. 

To find the vector proj v u, we will dot both sides of Equation (9.4) with the vector v, to find that 

u • v = (proj v u + proj ±v u) • v 

= (P r °j v u) • v + (proj ±v u) • v 
= (P ro j v u ) ' v - 


Notice that (proj v u) • v = 0 since proj v u is orthogonal to v. Also, proj v u must be a scalar 
multiple of v since it is parallel to v, so we will write proj v u = sv. It follows that 

u ■ v = (proj v u) • v = sv • v, 


which means that 


and hence 


u • v 

s = - 

v • v 


proj v u 


u • v 
-v = 

V • V 


U • V 



It is sometimes useful to write proj v u as a scalar times a unit vector in the direction of v. We 
call this scalar the component of u along v and denote it as comp v u. We therefore have 


P ro )v u = 


u • v 


u v v 


-V = 


v 


—r = comp U:—- 
v v 
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so that 


comp v u = 


U • V 


Let u and v be vectors in M n . The component of u in the direction of v is the scalar 

u • v 

comp v u = —, 

and the projection of u onto v is the vector 

u • v 

p r °j v u = -v. 

V • V 


Moreover, since 

u = proj v u + proj ±v u, 

it follows that 

p r °jj_v u = u - P r °jv u - 

This shows that once we have computed proj v u, we can find proj v u simply by calculating the 
difference of two known vectors. 

Activity 9.15. 

Let u = (2,6) and v = (4,-8). Lind comp v u, proj v u and proj ±v u, and draw a picture to 
illustrate. Linally, express u as the sum of two vectors where one is parallel to v and the other 
is perpendicular to v. 
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Summary 

• The dot product of two vectors in M n , u = (ui,U 2 , • • •, u n ) and v = (v±, V 2 , ■ ■ ■, v n ), is the scalar 

u ■ V = U\V\ + U 2 V2 H-1- u n v n . 


• The dot product is related to the length of a vector since u • u = |u| 2 . 

• The dot product provides us with information about the angle between the vectors since 

u • v = |u| |v| cos(0), 
where 6 is the angle between u and v. 

• Two vectors are orthogonal if the angle between them is 7r/2. In terms of the dot product, the 
vectors u and v are orthogonal if and only if u • v = 0. 

• The projection of a vector u in M n onto a vector v in M n is the vector 



proj v u 


u • v 

-v. 

v ■ v 
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Exercises 

1. Let v = (—2, 5) in M 2 , and let y = (0, 3, —2) in M 3 . 

(a) Is (2, —1) perpendicular to v? Why or why not? 

(b) Find a unit vector u in M 2 such that u is perpendicular to v. How many such vectors 
are there? 

(c) Is (2, —1, —2) perpendicular to y? Why or why not? 

(d) Find a unit vector winl 3 such that w is perpendicular to y. How many such vectors 
are there? 

(e) Let z = (2,1,0). Find a unit vector r in M 3 such that r is perpendicular to both y and z. 
How many such vectors are there? 

2. Consider the triangle in M 3 given by P(3, 2, —1), Q(l, —2,4), and R( 4,4,0). 

(a) Find the measure of each of the three angles in the triangle, accurate to 0.01 degrees. 

(b) Choose two sides of the triangle, and call the vectors that form the sides (emanating 
from a common point) a and b. 

i. Compute proj b a, and proj b a. 

ii. Explain why proj b a can be considered a height of triangle PQR. 

iii. Find the area of the given triangle. 

3. Let u and v be vectors in M 5 with u • v = —1, |u| = 2, |v| = 3, and 9 the angle between u and v. 
Use the properties of the dot product to find each of the following. 

(a) u • 2v 

(b) (u + v) • v 

(c) (2u + 4v) • (u — 7v) 

(d) v • v 

(e) |u||v|cos(0) 

(f) 9 
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9.4 The Cross Product 

Motivating Questions 

In this section, we strive to understand the ideas generated by the following important questions: 


• How and when is the cross product of two vectors defined? 

• What geometric information does the cross product provide? 


The last two sections have introduced some basic algebraic operations on vectors—addition, 
scalar multiplication, and the dot product—with useful geometric interpretations. In this section, 
we will meet a final algebraic operation, the cross product, which again conveys important geomet¬ 
ric information. 

To begin, we must emphasize that the cross product is only defined for vectors u and v in 
M 3 . Also, remember that we use a right-hand coordinate system, as described in Section 9.1. In 
particular, recall that the vectors i, j, and k are oriented as shown below in Figure 9.43. Earlier, 
we noticed that if we point the index finger of our right hand in the direction of i and our middle 
finger in the direction of j, then our thumb points in the direction of k. 



Figure 9.43: Basis vectors i, j, and k. 

Preview Activity 9.4. The cross product of two vectors, u and v, will itself be a vector denoted 
u x v. The direction of u x v is determined by the right-hand rule: if we point the index finger of 
our right hand in the direction of u and our middle finger in the direction of v, then our thumb 
points in the direction of u x v. 

(a) We begin by defining the cross products using the vectors i, j, and k. Referring to Figure 
9.43, explain why i, j, k in that order form a right-hand system. We then define i x j to be 
k - that is i x j = k. 

(b) Now explain why i, k, and —j in that order form a right-hand system. We then define i x k 
to be —j - that is i x k = —j. 

(c) Continuing in this way, complete the missing entries in Table 9.4. 
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x j = k 

i x k = -j 

j X k 

x i = 

k x i = 

kxj 


Table 9.4: Table of cross products involving i, j, and k. 

(d) Up to this point, the products you have seen, such as the product of real numbers and the 
dot product of vectors, have been commutative, meaning that the product does not depend 
on the order of the terms. For instance, 2 • 5 = 5 • 2. The table above suggests, however, 
that the cross product is anti-commutative: for any vectors u and v in M 3 , u x v = — v x u. 

If we consider the case when u = v, this shows that vxv = —vxv. What does this tell 
us about v x v; in particular, what vector is unchanged by scalar multiplication by —1? 

(e) The cross product is also a bilinear operation, meaning that it interacts with scalar multi¬ 
plication and vector addition as one would expect: (cu + v) x w = c(u x w) + v x w. For 
example, 

(2i + j) x k = 2(i x k) + (j x k) = -2j + i. 

Using this property along with Table 9.4, find the cross product u x v if u = 2i + 3j and 
v = -i + k. 

(f) Verify that the cross product u x v you just found in part (e) is orthogonal to both u and v. 

(g) Consider the vectors u and v in the xy-plane as shown below in Figure 9.44. 



Figure 9.44: Two vectors in the xy-plane 

Explain why u = |u|i and v = |v| cos(0)i + |v| sin(0)j. Then compute the length of |u x v|. 

(h) Multiplication of real numbers is associative, which means, for instance, that (2 • 5) • 3 = 
2 • (5 • 3). Is it true that the cross product of vectors is associative? For instance, is it true 
that (i x j) x j = i x (j x j)? 


cxi 
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Computing the cross product 

As we have seen in Preview Activity 9.4, the cross product u x v is defined for two vectors u and 
v in M 3 and produces another vector in M 3 . Using the right-hand rule, we saw that 

i X j = k, i x k = -j, j x k = i 

j x i = —k kxi=j kxj = —i 


If in addition we apply the bilinearity property to compute the cross product in terms of the 
components of general vectors, u and v we see that 7 


U X V 


(uii + M 2 j + U 3 k) x (uii + M 2 j + M 3 k) 
u\i x (v\i + M 2 j + M 3 k) + 
u 2 j x (uii + v 2 j + M 3 k) + 

M 3 k x (i>ii + m 2 j + M 3 k) 

ui^ii x i + U1V21 x j + u\v 3 i X k + 

u 2 v ij x i + u 2 v 2 j x j + u 2 v 3 j x k + 

u 3 v ik X i + U3V 2k x j + U3V3 k x k 

uiv 2 k — U1V3} — u 2 v\ k + U2V31 + U3V ij — U3V21 

{U2V3 — U3V 2)i — (u\V3 — tt 3 fl)j + (u\V2 ~ U2V\)k 


To summarize, we have shown that 


(«ii + u 2 j + w 3 k) x (vii + V2I + v 3 k) = ( U2V3 - U3V2) i - (uiv 3 - u 3 vi)j + (uiv 2 - u 2 v 1 )k. ( 9 . 5 ) 


At first, this may look intimidating and difficult to remember. However, if we rewrite the 
expression in Equation (9.5) using determinants, important structure emerges. The determinant 
of a 2 X 2 matrix is 


It follows that we can thus rewrite Equation (9.5) in the form 


U 2 

U 3 


Ml 

u 3 

j + 

Ml 

U 2 

V 2 

V 3 

i — 

Ml 

M 3 

Ml 

M 2 


For those familiar with the determinant of a 3 x 3 matrix, we write the mnemonic 


U X V = 


i 

j 

k 

Ml 

u 2 

U3 

Ml 

V2 

V3 


7 Like the dot product, the cross product arises in physical applications, e.g., torque, but is it more convenient math¬ 
ematically to begin from an algebraic perspective. 
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Activity 9.16. 

Suppose u = (2, —1,0) and v = (0,1,3). Use the formula (9.5) for the following. 

(a) Find the cross product u x v. 

(b) Find the cross product u x i. 

(c) Find the cross product u x u. 

(d) Evaluate the dot products u • (u x v) and v • (u x v). What does this tell you about the 
geometric relationship among u, v, and u x v? 


<1 

The cross product satisfies the following properties, which were illustrated in Preview Activity 
9.4 and may be easily verified from the definition (9.5). 

Fet u, v, and w be vectors in M 3 , and let c be a scalar. Then 

1. u X V = —(v X u) 

2. (cu + v) X w = c(u X w) + V X w 

3. u x v = 0 if u and v are parallel. 

4. The cross product is not associative; that is, in general 

(u x v) X w / u x (v x w). 


Just as we found for the dot product, the cross product provides us with useful geometric infor¬ 
mation. In particular, both the length and direction of the cross product u x v encode information 
about the geometric relationship between u and v. 


The Length of u x v 

We may ask whether the length |u x v| has any relationship to the lengths of u and v. To investi¬ 
gate, we will compute the square of the length | u X v| 2 and denote by 6 the angle between u and 
v, as in Section 9.3. Doing so, we find through some significant algebra that 
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u x v| 2 


(u 2 V 3 - U3V2) 2 + (U1V3 - U 3 Vl) 2 + - U 2 V l) 2 

u 2 v 3 — 2 u 2 u 3 v 2 v 3 + tt 3 f 2 + ^1^3 — 2’Ui«3t'iU3 + u 3 v 2 + — 2miZ4 2 uit; 2 + u 2 V[ 

u\{vl + v 2 ) + u 2 2 (v\ + vl) + u 2 3 (vf + v 2 ) - 2 {uiu 2 viv 2 + uiu 3 viv 3 + u 2 u 3 v 2 v 3 ) 
u i( v i + v 2 + v 3 ) + u 2 (y\ + v 2 + u|) + + w 2 + u 3 ) — 

(’ u\v\ + U 2 vl + u\v 3 + 2 (ui'lt 2 ^lV 2 + U\U 3 V\V 3 + U 2 U 3 V 2 V 3 )) 

(uf + u\ + u\)(v\ + v\ + vl) - (uiVl + «2U 2 + Ms^) 2 

|u| 2 |v| 2 - (u • v) 2 

|u| 2 |v| 2 (l-cos 2 (0)) 

|u| 2 |v| 2 sm 2 (0). 


Therefore, we have found |u x v| 2 = |u| 2 |v| 2 sin 2 (0), which means that 

|u x v| = |u||v| sin(0). (9.6) 

Note that the third property stated above says that u x v = 0 if u and v are parallel. This is 
reflected in Equation (9.6) since sin(0) = 0 if u and v are parallel, which implies that u x v = 0. 

Equation (9.6) also has a geometric implication. Consider the parallelogram formed by two 
vectors u and v, as shown in Figure 9.45. 



Figure 9.45: The parallelogram formed by u and v 


Remember that the area of a parallelogram is the product of its base and height. As shown in 
the figure, we may consider the base of the parallelogram to be |u| and the height to be |v| sin(0). 
This means that the area of the parallelogram formed by u and v is 

|u||v| sin($) = |u x v|. 

This leads to the following interesting fact. 

The length, |u x v|, of the cross product of vectors u and v is the area of the parallelogram 
determined by u and v. 
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Activity 9.17. 

(a) Find the area of the parallelogram formed by the vectors u = (1,3, —2) and v = (3, 0,1). 

(b) Find the area of the parallelogram in M 3 whose vertices are (1,0,1), (0,0,1), (2,1,0), 
and (1,1,0). 

<1 


The Direction of u x v 

Now that we understand the length of u x v, we will investigate its direction. Remember from 
Preview Activity 9.4 that cross products involving the vectors i, j, and k resulted in vectors that 
are orthogonal to the two terms. We will see that this holds more generally. 

We begin by computing u • (u x v), and see that 

U • (u X v) = U\(U2V3 — U3V2) — U2(u\V3 — U3V1) + U3(u\V2 — U2Vl) 

= U1U2V3 — U1U3V2 — U2U1V3 + U2U3V1 + U3U1V2 ~ U3U2V1 

= 0 

To summarize, we have u • (u x v) = 0, which implies that u is orthogonal to u x v. In the same 

way, we can show that v is orthogonal to u x v. The net effect is that u x v is a vector that is 

perpendicular to both u and v, and hence u x v is perpendicular to the plane determined by u 
and v. Moreover, the direction of u x v is determined by applying the right-hand rule to u and v, 
as we saw in Preview Activity 9.4. In light of our earlier work that showed |u| | v| sin(0) = |u x v|., 
we may now express u x v in the following different way. 

Suppose that u and v are not parallel and that n is the unit vector perpendicular to the plane 
containing u and v determined by the right-hand rule. Then 

u x v = |u||v| sin(0) n. 


There is yet one more geometric implication we may draw from this result. Suppose u, v, and 
w are vectors in M 3 that are not coplanar and that form a three-dimension parallelepiped as shown 
in Figure 9.46. 

The volume of the parallelepiped is determined by multiplying A, the area of the base, by the 
height h. As we have just seen, the area of the base is |u x v|. Moreover, the height h = |w| cos (a) 
where a is the angle between w and the vector n, which is orthogonal to the plane formed by u 
and v. Since n is parallel to u x v, the angle between w and u x v is also a. This shows that 

|(u x v) • w| = |u x w||w| cos(a) = Ah, 
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Figure 9.46: The parallelepiped determined by u, v, and w 


and therefore 


The volume of the parallelepiped determined by u, v, and w is | (u x v) • w 


The quantity |(u x v) ■ w| is sometimes called the scalar triple product. 

Activity 9.18. 

Suppose u = (3, 5, —1) and v = (2, —2,1). 

(a) Find two unit vectors orthogonal to both u and v. 

(b) Find the volume of the parallelepiped formed by the vectors u, v, and w = (3, 3,1). 

(c) Find a vector orthogonal to the plane containing the points (0,1, 2), (4,1, 0), and (—2, 2, 2). 

(d) Given the vectors u and v shown below in Figure 9.47, sketch the cross products u x v 
and v x u. 



Figure 9.47: Vectors u and v 


(e) Do the vectors a = (1,3, —2), b = (2,1, —4), and c = (0,1,0) lie in the same plane? Use 
the concepts from this section to explain. 
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Figure 9.48: A force applied to a wrench 
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Torque is measured by a cross product 

We have seen that the cross product enables us to produce a vector perpendicular to two given 
vectors, to measure the area of a parallelogram, and to measure the volume of a parallelepiped. 
Besides these geometric applications, the cross product also enables us to describe a physical quan¬ 
tity called torque. 

Suppose that we would like to turn a bolt using a wrench as shown in Figure 9.48. If a force F 
is applied to the wrench and r is the vector from the position on the wrench at which the force is 
applied to center of the bolt, we define the torque, r, to be 


r = F x r. 


When a force is applied to an object, Newton's Second Faw tells us that the force is equal to 
the rate of change of the object's linear momentum. Similarly, the torque applied to an object is 
equal to the rate of change of the object's angular momentum. In other words, torque will cause 
the bolt to rotate. 

In many industrial applications, bolts are required to be tightened using a specified torque. 
Of course, the magnitude of the torque is |t| = |F x r| = |F||r|| sin(0). Thus, to produce a larger 
torque, we can increase either |F| or |r|, which you may know if you have ever removed lug nuts 
when changing a flat tire. The ancient Greek mathematician Archimedes said: "Give me a lever 
long enough and a fulcrum on which to place it, and I shall move the world." A modern spin on 
this statement is: "Allow me to make |r| large enough, and I shall produce a torque large enough 
to move the world." 


Comparing the dot and cross products 

Finally, it is worthwhile to compare and contrast the dot and cross products. 
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(a) u • v is a scalar, while u x v is a vector. 

(b) u • v = v • u, while u x v = — v x u 

(c) u • v = |u||v| cos (9), while |u x v| = |u||v| sin(0). 

(d) u ■ v = 0 if u and v are perpendicular, while u x v = 0 if u and v are parallel. 

Summary 

• The cross product is defined only for vectors in M 3 . The cross product of vectors u and vinK 3 
is the vector 

Oii + u 2 j + w 3 k) x (Tii + n 2 j + ^k) = {u 2 v 3 - u 3 v 2 )\ - ( uiv 3 - u 3 v i)j + ( uiv 2 - u 2 v i)k. 

• Geometrically, the cross product is 


u X v = |u| |v| sin(0) n, 

where 9 is the angle between u and v and n is a unit vector perpendicular to both u and v as 
determined by the right-hand rule. 

• The cross product of vectors u and v is a vector perpendicular to both u and v. 

• The magnitude |u x v| of the cross product of the vectors u and v gives the area of the parallel¬ 
ogram determined by u and v. Also, the scalar triple product |(u x v) • w| gives the volume of 
the parallelepiped determined by u, v, and w. 


Exercises 

1. Let u = 2i + j and v = i + 2j be vectors in M 3 . 

(a) Without doing any computations, find a unit vector that is orthogonal to both u and v. 
What does this tell you about the formula for u x v? 

(b) Using the bilinearity of the cross product and what you know about cross products 
involving the fundamental vectors i and j, compute u x v. 

(c) Next, use the determinant version of Equation (9.5) to compute u x v. Write one sentence 
that compares your results in (a), (b), and (c). 

(d) Find the area of the parallelogram determined by u and v. 

2. Let x = (1,1,1) and y = (0, 3, —2). 

(a) Are x and y orthogonal? Are x and y parallel? Clearly explain how you know, using 
appropriate vector products. 

(b) Find a unit vector that is orthogonal to both x and y. 
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(c) Express y as the sum of two vectors: one parallel to x, the other orthogonal to x. 

(d) Determine the area of the parallelogram formed by x and y. 

3. Consider the triangle in M 3 formed by P(3, 2, —1), Q( 1, —2,4), and R(4,4, 0). 

(a) Find Pc) and I’ll. 

(b) Observe that the area of A PQR is half of the area of the parallelogram formed by I’Q 
and I'll. Hence find the area of A PQR. 

(c) Find a unit vector that is orthogonal to the plane that contains points P, Q, and R. 

(d) Determine the measure of ZPQR. 
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9.5 Lines and Planes in Space 

Motivating Questions 

In this section, zve strive to understand the ideas generated by the follozving important questions: 

• How are lines in M 3 similar to and different from lines in M 2 ? 

• What is the role that vectors play in representing equations of lines, particularly in M 3 ? 

• How can we think of a plane as a set of points determined by a point and a vector? 

• How do we find the equation of a plane through three given non-collinear points? 


Introduction 


In single variable calculus, we learn that a differentiable function is locally linear . In other words, 
if we zoom in on the graph of a differentiable function at a point, the graph looks like the tangent 
line to the function at that point. In multivariable calculus, we will soon study curves in space; 
differentiable curves turn out to be locally linear as well. In addition, as we study functions of two 
variables, we will see that such a function is locally linear at a point if the surface defined by the 
function looks like a plane (the tangent plane) as we zoom in on the graph. 

Consequently, it is important for us to understand both lines and planes in space. We begin 
our work by considering some familiar ideas in M 2 but from a new perspective. 

Preview Activity 9.5. We are familiar with equations of lines in the plane in the form y = mx + b, 
where m is the slope of the line and (0, b ) is the y-intercept. In this activity, we explore a more 
flexible way of representing lines that we can use not only in the plane, but in higher dimensions 
as well. 

To begin, consider the line through the point (2,-1) with slope |. 




y 
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(a) Suppose we increase x by 1 from the point (2, —1). How does the y-value change? What is 
the point on the line with x-coordinate 3? 
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(b) Suppose we decrease x by 3.25 from the point (2, —1). How does the y-value change? What 
is the point on the line with x-coordinate —1.25? 

(c) Now, suppose we increase x by some arbitrary value 3 1 from the point (2, —1). How does 
the y-value change? What is the point on the line with x-coordinate 2 + 3 tl 

(d) Observe that the slope of the line is related to any vector whose y-component divided by 
the x-component is the slope of the line. For the line in this exercise, we might use the 
vector (3, 2), which describes the direction of the line. Explain why the terminal points of 
the vectors r(f), where 

r (t) = <2, —1) + <3, 2)t, 

trace out the graph of the line through the point (2,-1) with slope 

(e) Now we extend this vector approach to M 3 and consider a second example. Let C be the 
line in M 3 through the point (1,0, 2) in the direction of the vector (2, —1,4). 

Find the coordinates of three distinct points on line C. Explain your thinking. 

(f) Find a vector in form 

r (t) = (x 0 ,yo,z 0 ) + (a, b, c)t 

whose terminal points trace out the line £ that is described in (e). That is, you should be 
able to locate any point on the line by determining a corresponding value of t. 




Lines in Space 

In two-dimensional space, a non-vertical line is defined to be the set of points satisfying the equa¬ 
tion 

y = mx + b, 

for some constants m and b. The value of m (the slope) tells us how the dependent variable 
changes for every one unit increase in the independent variable, while the point (0, b) is the y- 
intercept and anchors the line to a location on the y-axis. Alternatively, we can think of the slope 
as being related to the vector (1, m), which tells us the direction of the line, as shown on the left in 
Figure 9.49. Thus, we can identify a line in space by fixing a point P and a direction v, as shown 
on the right. 


Definition 9.7. A line in space is the set of terminal points of vectors emanating from a given 
point P that are parallel to a fixed vector v. 


The fixed vector v in the definition is called a direction vector for the line. As we saw in Preview 
Activity 9.5, to find an equation for a line through point P in the direction of vector v, observe that 
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any vector parallel to v will have the form tv for some scalar t. So, any vector emanating from the 
point P in a direction parallel to the vector v will be of the form 

of> + vt (9.7) 


for some scalar t (where O is the origin). 





Figure 9.50: A line in 2-space. 


Figure 9.50 presents three images of a line in two-space in which we can identify the vector 
Op and the vector tv as in Equation (9.7). Here, OP is the fixed vector shown in blue, while 
the direction vector v is the vector parallel to the vector shown in green (that is, the green vector 
represents tv, and the line is traced out by the terminal points of the magenta vector). In other 
words, the tips (terminal points) of the magenta vectors (the vectors of the form Op + tv) trace 
out the line as t changes. 

In particular, the terminal points of the vectors of the form in (9.7) define a linear function r in 
space of the following form, which is valid in any dimension. 


The vector form of a line through the point P in the direction of the vector v is 

r(t) = r 0 + tv, (9.8) 

where ro is the vector OP from the origin to the point P. 
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Of course, it is common to represent lines in the plane using the slope-intercept equation y = 
mx + b. The vector form of the line, described above, is an alternative way to represent lines that 
has the following two advantages. First, in two dimensions, we are able to represent vertical lines, 
whose slope m is not defined, using a vertical direction vector, such as v = (0,1). Second, this 
description of lines works in any dimension even though there is no concept of the slope of a line 
in more than two dimensions. 



Figure 9.51: A line in 3-space. 


Activity 9.19. 

Let P\ = (1, 2, —1) and P -2 = (—2,1, —2). Let C be the line in M 3 through P\ and P 2 , and note 
that three snapshots of this line are shown in Figure 9.51. 

(a) Find a direction vector for the line C. 

(b) Find a vector equation of C in the form r(i) = ro + tv. 

(c) Consider the vector equation s (t) = (—5,0, —3) + t( 6, 2, 2). What is the direction of the 
line given by s(i)? Is this new line parallel to line CP. 

(d) Do r (t) and s (t) represent the same line, £? Explain. 
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The Parametric Equations of a Line 

The vector form of a line, r(t) = ro + tv in Equation (9.8), describes a line as the set of terminal 
points of the vectors r (t). If we write this in terms of components letting 

r (t) = (x(t),y(t),z(t)), r 0 = (x 0 ,yo,zo), and v = (a,b,c), 

then we can equate the components on both sides of r(t) = ro + tv to obtain the equations 

x(t) = xq + at, y(t ) = yo + bt, and z(t) = zq + ct, 












54 


9.5. LINES AND PLANES IN SPACE 


which describe the coordinates of the points on the line. The variable t represents an arbitrary 
scalar and is called a parameter. In particular, we use the following language. 

The parametric equations for a line through the point P = (xq, ijq . zq) in the direction of the 
vector v = (a, b , c) are 

x(t) = x 0 + at, y(t ) = yo + bt, z(t ) = z 0 + ct. 


Notice that there are many different parametric equations for the same line. For example, 
choosing another point P on the line or another direction vector v produces another set of para¬ 
metric equations. It is sometimes useful to think of t as a time parameter and the parametric 
equations as telling us where we are on the line at each time. In this way, the parametric equations 
describe a particular walk taken along the line; there are, of course, many possible ways to walk 
along a line. 

Activity 9.20. 

Let P\ = (1, 2, —1) and P 2 = (—2,1, —2), and let £ be the line in M 3 through Pi and Pi, which is 
the same line as in Activity 9.19. 

(a) Find parametric equations of the line £. 

(b) Does the point (1, 2,1) lie on £1 If so, what value of t results in this point? 

(c) Consider another line, AC, whose parametric equations are 

x(s) = —2 + 4s, y(s) = 1 — 3s, z(s) = —2 + 2s. 

What is the direction of line AT? 

(d) Do lines £ and AC intersect? If so, provide the point of intersection and the t and s values, 
respectively, that result in the point. If not, explain why. 
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Planes in Space 

Now that we have a way of describing lines, we would like to develop a means of describing 
planes in three dimensions. We studied the coordinate planes and planes parallel to them in 
Section 9.1. Each of those planes had one of the variables x, y, or z equal to a constant. We can 
note that any vector in a plane with x constant is orthogonal to the vector (1, 0, 0), any vector in a 
plane with y constant is orthogonal to the vector (0,1,0), and any vector in a plane with 0 constant 
is orthogonal to the vector (0, 0,1). This idea works in general to define a plane. 


Definition 9.8. A plane p in space is the set of all terminal points of vectors emanating from a 
given point Pq perpendicular to a fixed vector n, as shown in Figure 9.52. 


@©@© 












9.5. LINES AND PLANES IN SPACE 


55 


■ = (a, b, c) 



Figure 9.52: A point Pq on a plane p with a normal vector n 


The definition allows us to find the equation of a plane. Assume that n = (a, b, c), Pq = 
(xo, yo, zo), and that P = (x. y, z ) is an arbitrary point on the plane. Since the vector PPq lies in the 
plane, it must be perpendicular to n. This means that 

0 = n • PPq 

= n • ((x,y,z) - (x 0 ,yo,z 0 )) 

= n • (x - x 0 ,y - yo,z - z 0 ) 

= a(x - x 0 ) + b(y - y 0 ) + c{z - z 0 ). 


The fixed vector n perpendicular to the plane is frequently called a normal vector to the plane. 
We may now summarize as follows. 

The scalar equation of the plane with normal vector n = (a, 6 , c) containing the point Po = 

(x 0 ,y 0 ,z 0 ) is 

a(x - x 0 ) + b(y - y 0 ) + c(z - z 0 ) = 0. (9.9) 


We may take this a little further and note that since 

a(x - x 0 ) + b(y - y 0 ) + c(z - z 0 ) = 0, 

it equivalently follows that 

ax + by + cz = ax o + byo + czq. 

That is, we may write the equation of a plane as ax + by + cz = d where where d = n ■ (xq, yo, z'o). 

For instance, if we would like to describe the plane passing through the point Po = (4, —2,1) 
and perpendicular to the vector n = (1,2,1), we have 


(1) 2,1) ■ (x, y, z) = <1,2,1) • (4, —2,1) 
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or 

x + 2 y + z = 1. 

Notice that the coefficients of x, y, and z in this description give a vector perpendicular to the 
plane. For instance, if we are presented with the plane 

— 2x + y — 3z = 4, 

we know that n = (—2,1, —3) is a vector perpendicular to the plane. 

Activity 9.21. 

(a) Write the equation of the plane p\ passing through the point (0,2,4) and perpendicular 
to the vector n = (2, —1,1). 

(b) Is the point (2, 0, 2) on the plane pi? 

(c) Write the equation of the plane p 2 that is parallel to p\ and passing through the point 

(3,0,4). 

(d) Write the parametric description of the line l passing through the point (2,0,2) and 
perpendicular to the plane ps described the equation x + 2y — 2z = 7. 

(e) Find the point at which l intersects the plane P 3 . 

<1 

Just as two distinct points in space determine a line, three non-collinear points in space de¬ 
termine a plane. Consider three points Pq, Pi, and P 2 in space, not all lying on the same line as 
shown in Figure 9.53. 



Figure 9.53: A plane determined by three points Pq, Pi, and P 2 

Observe that the vectors PqP[ and PqF 2 both lie in the plane p. If we form their cross-product 

n = P 0 Pi x P 0 P 2 , 

we obtain a normal vector to the plane p. Therefore, if P is any other point on p, it then follows 
that PqP will be perpendicular to n, and we have, as before, the equation 

n • = 0. (9.10) 
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Activity 9.22. 

Let Pq = (1, 2, —1), Pi = (1, 0, —1), and P 2 = (0,1, 3) and let p be the plane containing Pq, Pi, 
and P 2 . 

(a) Determine the components of the vectors PqP 1 and P 0 P 2 . 

(b) Find a normal vector n to the plane p. 

(c) Find the scalar equation of the plane p. 

(d) Consider a second plane, q, whose scalar equation is — 3(x — 1 ) + 4 (y + 3) + 2(z — 5) = 0. 
Find two different points on plane q, as well as a vector m that is normal to q. 

(e) We define the angle between two planes to be the angle between their respective normal 
vectors. What is the angle between planes p and ql 

<1 


Summary 

• While lines in M 3 do not have a slope, like lines in M 2 they can be characterized by a point and 
a direction vector. Indeed, we define a line in space to be the set of terminal points of vectors 
emanating from a given point that are parallel to a fixed vector. 

• Vectors play a critical role in representing the equation of a line. In particular, the terminal 
points of the vector r (t) = ro + fv define a linear function r in space through the terminal point 
of the vector ro in the direction of the vector v, tracing out a line in space. 

• A plane in space is the set of all terminal points of vectors emanating from a given point per¬ 
pendicular to a fixed vector. 

• If P\, P 2 , and / ;<, are non-cobin ear p oints in space, the vectors P\ P 2 and and P1P3 are vectors in 
the plane and the vector n = P1P2 x P\ P> is a normal vector to the plane. So any point P in 
the plane satisfies the equation PP\ • n = 0. If we let P = (x, y. z), n = (a, b, c ) be the normal 
vector, and Pi = (xo, Vo■ zq), we can also represent the plane with the equation 

a(x - x 0 ) + b(y - y 0 ) + c(z - z 0 ) = 0. 


Exercises 

1. The vector and parametric forms of a line allow us to easily describe line segments in space. 

Let Pi = (1, 2, —1) and P 2 = (—2,1, —2), and let C be the line in M 3 through Pi and P 2 as in 
Activity 9.19. 

(a) What value of the parameter t makes (x(t),y(t), z(t)) = Pi? What value of t makes 

(x (t),y(t),z(t)) = P 2 ? 

(b) What restrictions on the parameter t describe the line segment between the points Pi 
and P 2 ? 
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(c) What about the line segment (along the same line) from (7,4,1) to (—8, —1, —4)? 

(d) Now, consider a segment that lies on a different line: parameterize the segment that 
connects point R = (4, —2, 7) to Q = (—11,4, 27) in such a way that t = 0 corresponds 
to point Q, while t = 2 corresponds to R. 

2. This exercise explores key relationships between a pair of lines. Consider the following two 
lines: one with parametric equations x(s) =4 — 2 s, y(s ) = —2 + s, z(s) = 1 + 3s, and the other 
being the line through (—4, 2,17) in the direction v = (—2,1, 5). 

(a) Find a direction vector for the first line, which is given in parametric form. 

(b) Find parametric equations for the second line, written in terms of the parameter t. 

(c) Show that the two lines intersect at a single point by finding the values of s and t that 
result in the same point. 

(d) Find the angle formed where the two lines intersect, noting that this angle will be given 
by the angle between their respective direction vectors. 

(e) Find an equation for the plane that contains both of the lines described in this problem. 

3. This exercise explores key relationships between a pair of planes. Consider the following two 
planes: one with scalar equation 4x — by + z = —2, and the other which passes through the 
points (1,1,1), (0,1, —1), and (4, 2, —1). 

(a) Find a vector normal to the first plane. 

(b) Find the scalar equation for the second plane. 

(c) Find the angle between the planes, where the angle between them is defined by the 
angle between their respective normal vectors. 

(d) Find a point that lies on both planes. 

(e) Since these two planes do not have parallel normal vectors, the planes must intersect, 
and thus must intersect in a line. Observe that the line of intersection lies in both planes, 
and thus the direction vector of the line must be perpendicular to each of the respective 
normal vectors of the two planes. Find a direction vector for the line of intersection for 
the two planes. 

(f) Determine parametric equations for the line of intersection of the two planes. 

4. In this problem, we explore how we can use what we know about vectors and projections to 
find the distance from a point to a plane. 

Let p be the plane with equation z = —4x + 3y + 4, and let Q = (4, — 1, 8). 

(a) Show that Q does not lie in the plane p. 

(b) Find a normal vector n to the plane p. 
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(c) Find the coordinates of a point P in p. 

(d) Find the components of P$. Draw a picture to illustrate the objects found so far. 

(e) Explain why |comp n /-Y)| gives the distance from the point Q to the plane p. Find this 
distance. 
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9.6 Vector-Valued Functions 

Motivating Questions 

In this section, zve strive to understand the ideas generated by the following important questions: 


• What is a vector-valued function? What do we mean by the graph of a vector-valued func¬ 
tion? 

• What is a parameterization of a curve in M 2 ? In R 3 ? What can the parameterization of a 
curve can tell us? 


Introduction 

So far, we have seen several different examples of curves in space, including traces and contours 
of functions of two variables, as well as lines in 3-space. Recall that for a line through a fixed point 
ro in the direction of vector v, we may express the line parametrically through the single vector 
equation 

r (t) = r 0 + tv. 

From this perspective, the vector r(t) is a function that depends on the parameter t, and the ter¬ 
minal points of this vector trace out the line in space. 

Like lines, other curves in space are one-dimensional objects, and thus we aspire to similarly 
express the coordinates of points on a given curve in terms of a single variable. Vectors are a 
perfect vehicle for doing so - we can use vectors based at the origin to identify points in space, 
and connect the terminal points of these vectors to draw a curve in space. This approach will allow 
us to draw an incredible variety of graphs in 2- and 3-space, as well as to identify and describe 
curves in n-space for any n. 

Preview Activity 9.6. In this activity we consider how we might use vectors to define a curve in 
space. 

(a) On a single set of axes in M 2 , draw the vectors (cos(0), sin(0)), (cos (|) , sin (|)), 

(cos ( 7 r) , sin (7r)), and (cos (4f) , sin (4f)) with their initial points at the origin. 

(b) On the same set of axes, draw the vectors (cos ((f) , sin (|)), (cos (if) , sin (if-)), 

(cos (if 1 ) , sin (^f)), and (cos (if 1 ) , sin (-(f)) with their initial points at the origin. 

(c) Based on the pictures from parts (a) and (b), sketch the set of terminal points of all of 
the vectors of the form (cos(t), sin(f)), where t assumes values from 0 to 27 t. What is the 
resulting figure? Why? 
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Vector-Valued Functions 

Consider the curve shown in Figure 9.54. As in Preview Activity 9.6, we can think of a point on 
this curve as resulting from a vector from the origin to the point. As the point travels along the 
curve, the vector changes in order to terminate at the desired point. A few still pictures of this 
motion are shown in Figure 9.54. 





Figure 9.54: The graph of a curve in space. 


Thus, we can think of the curve as a collection of terminal points of vectors emanating from 
the origin. We therefore view a point traveling along this curve as a function of time t, and define 
a function r whose input is the variable t and whose output is the vector from the origin to the 
point on the curve at time t. In so doing, we have introduced a new type of function, one whose 
input is a scalar and whose output is a vector. 

The terminal points of the vector outputs of r then trace out the curve in space. From this 
perspective, the x, y, and z coordinates of the point are functions of time, t, say 


x = x(t), y = y(t), and z = z(t), 


and thus we have three coordinate functions that enable us to represent the curve. The variable t is 
called a parameter and the equations x = x(t), y = y[t), and z = z(t) are called parametric equations 
(or a parameterization of the curve). The function r whose output is the vector from the origin to a 
point on the curve is defined by 


r(i) = (x(t),y(t),z(t)). 


Note that the input of r is the real-valued parameter t and the corresponding output is vector 
(x(t),y(t),z(t)). Such a function is called a vector-valued function because each real number input 
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generates a vector output. More formally, we state the following definition. 

Definition 9.9. A vector-valued function is a function whose input is a real parameter t and 
whose output is a vector that depends on t. The graph of a vector-valued function is the set of 
all terminal points of the output vectors with their initial points at the origin. 

Parametric equations for a curve are equations of the form 

x = x(t), y = y(t), and z = z(t) 

that describe the (x, y, z ) coordinates of a point on a curve in M 3 . 


Note particularly that every set of parametric equations determines a vector-valued function 
of the form 

r (t) = (x(t),y(t),z(t)), 

and every vector-valued function defines a set of parametric equations for a curve. Moreover, we 
can consider vector-valued functions and parameterizations in M 2 , M 4 , or indeed a real space of 
any dimension. As a reminder, in Section 9.5, we determined the parametric equations of a line in 
space using a point and a direction vector. For a nonlinear example, the curve in Figure 9.54 has 
the parametric equations 

x(t) = cos(f), y(t) = sin(f), and z(t) = cos (t) sin (t). 

Represented as a vector-valued function r, the curve in Figure 9.54 is the graph of 

r(f) = (cos(f), sin(f), cos(f) sin(f)). 


Activity 9.23. 

The same curve can be represented with different parameterizations. Use your calculator, 8 
Wolfram | Alpha, or some other graphing device 9 to plot the curves generated by the following 
vector-valued functions. Compare and contrast the graphs - explain how they are alike and 
how they are different. 

(a) r(t) = (sin(f), cos(t)) 

(b) r(t) = (sin(2f), cos(2f)) 

(c) r(f) = (cos(f + 7 r), sin (t + ir)) 


<1 

8 If you have a graphing calculator you can draw graphs of vector-valued functions in R 2 using the parametric mode 
(often found in the MODE menu). 

9 e.g., http : //webspace . ship . edu/msrenault/ggb/parametric_grapher . html 
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The examples in Activity 9.23 illustrate that a parameterization allows us to look not only at 
the graph, but at the direction and speed at which the graph is traversed as t changes. In the 
different parameterizations of the circle, we see that we can start at different points and move 
around the circle in either direction. The calculus of vector-valued functions - which we will 
begin to investigate in Section 9.7 - will enable us to precisely quantify the direction, speed, and 
acceleration of a particle moving along a curve in space. As such, describing curves parametrically 
will allow us to not only indicate the curve itself, but also to describe how motion occurs along 
the curve. 

Using parametric equations to define vector-valued functions in two dimensions is much more 
versatile than just defining y as a function of x. In fact, if y = fix) is a function of x, then we can 
parameterize the graph of f by 

r(t) = (t,f(t)), 

and thus every single-variable function may be described parametrically. In addition, as we saw 
in Preview Activity 9.6 and Activity 9.23, we can use vector-valued functions to represent curves 
in the plane that do not define y as a function of x (or x as a function of y). 10 

Activity 9.24. 

Vector-valued functions can be used to generate many interesting curves. Graph each of the 
following using an appropriate tool 11 , and then write one sentence for each function to describe 
the behavior of the resulting curve. 

(a) r(t) = (t cos(f),tsin(f)) 

(b) r (t) = (sin(t) cos(t), tsin(t)) 

(c) r (t) = (t 2 sin(i) cos(f), 0.9t cos(t 2 ), sin(t)) 

(d) r(t) = (sin(5f), sin(4i)) 

(e) Experiment with different formulas for x(t) and y(t) and ranges for t to see what other 
interesting curves you can generate. Share your best results with peers. 
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Recall from our earlier work that the traces and level curves of a function are themselves curves 
in space. Thus, we may determine parameterizations for them. For example, if z = fix. y) = 
cos(x 2 + y 2 ), the y = 1 trace of the function is given by setting y = 1 and letting x be parameterized 
by the variable t; then, the trace is the curve whose parameterization is (t. 1, cos(f 2 + 1)). 

Activity 9.25. 

Consider the paraboloid defined by f(x, y) = x 2 + y 2 . 

10 As an aside, vector-valued functions make it easy to plot the inverse of a one-to-one function in two dimensions. 
To see how, if y = f(x) defines a one-to-one function, then we can parameterize this function by r(t) = (t, Since 
the inverse function just reverses the role of input and output, a parameterization for / _1 is (/(£) , t). 

n e.g., the 2D grapher at http://webspace.ship.edu/msrenault/ggb/parametric_grapher.html, or 
for 3D graphs Wolfram|Alpha, an on-line 3D grapher like http://www.math.uri.edu/~bkaskosz/flashmo/ 
parcur/, or some other device 
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(a) Find a parameterization for the x = 2 trace of /. What type of curve does this trace 
describe? 

(b) Find a parameterization for the y = — 1 trace of /. What type of curve does this trace 
describe? 

(c) Find a parameterization for the level curve fix, y ) = 25. What type of curve does this 
trace describe? 

(d) How do your responses change to all three of the preceding question if you instead 
consider the function g defined by g{x , y) = x 2 — y 2 ? (Hint for generating one of the 
parameterizations: sec 2 (t) — tan 2 (f) = 1.) 
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Summary 

• A vector-valued function is a function whose input is a real parameter t and whose output is a 
vector that depends on t . The graph of a vector-valued function is the set of all terminal points 
of the output vectors with their initial points at the origin. 

• Every vector-valued function provides a parameterization of a curve. In M 2 , a parameterization 
of a curve is a pair of equations x = x(t) and y = y{t) that describes the coordinates of a point 
(x, y) on the curve in terms of a parameter t. In M 3 , a parameterization of a curve is a set of 
three equations x = x(t), y = y(t), and z = z(t) that describes the coordinates of a point (x, y, z) 
on the curve in terms of a parameter t. 


Exercises 

1. The standard parameterization for the unit circle is (cos(f), sin(i)), for 0 < t < 27r. 

(a) Find a vector-valued function r that describes a point traveling along the unit circle so 

that at time t = 0 the point is at and travels clockwise along the circle as t 

increases. 

(b) Find a vector-valued function r that describes a point traveling along the unit circle so 
that at time t = 0 the point is at (yyr- j and travels counter-clockwise along the circle 
as t increases. 

(c) Find a vector-valued function r that describes a point traveling along the unit circle so 

that at time t = 0 the point is at j and travels clockwise along the circle as t 

increases. 

2. Let a and b be positive real numbers. You have probably seen the equation + (u j = 1 

that generates an ellipse, centered at ( h, k), with a horizontal axis of length 2 a and a vertical 
axis of length 2 b. 
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(a) Explain why the vector function r defined by r (t) = (a cos(f), b sin(t)), 0 < t < 2 tt is one 

2 2 

parameterization of the ellipse ^ + p- = 1. 

2 2 

(b) Find a parameterization of the ellipse ^ ^ = 1 that is traversed counterclockwise. 

(c) Find a parameterization of the ellipse tUAL _|_ = i. 

(d) Determine the x-y equation of the ellipse that is parameterized by 

r(t) = (3 + 4sin(2f), 1 + 3cos(2t)). 

3. Consider the two-variable function z = f(x, y) = 3.x 2 + Ay 2 — 2. 

(a) Determine a vector-valued function r that parameterizes the curve which is the x = 2 
trace of z = f(x, y). Plot the resulting curve. Do likewise for x = —2, —1, 0, and 1. 

(b) Determine a vector-valued function r that parameterizes the curve which is the y = 2 
trace of z = f(x, y). Plot the resulting curve. Do likewise for y = —2, —1, 0, and 1. 

(c) Determine a vector-valued function r that parameterizes the curve which is the z = 2 
contour of z = f(x , y). Plot the resulting curve. Do likewise for z = —2, —1, 0, and 1. 

(d) Use the traces and contours you've just investigated to create a wireframe plot of the 
surface generated by z = f(x,y). In addition, write two sentences to describe the char¬ 
acteristics of the surface. 

4. Recall that any line in space may be represented parametrically by a vector-valued function. 

(a) Find a vector-valued function r that parameterizes the line through (—2,1,4) in the 
direction of the vector v = (3, 2, —5). 

(b) Find a vector-valued function r that parameterizes the line of intersection of the planes 

x + 2y — z = 4 and 3x + y — 2z = 1. 

(c) Determine the point of intersection of the lines given by 

x = 2 + 3t, y = 1 — 2t, z = 4f, 
x = 3 + Is, y = 3 — 2s, z = 2s. 

Then, find a vector-valued function r that parameterizes the line that passes through 
the point of intersection you just found and is perpendicular to both of the given lines. 
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9.7 Derivatives and Integrals of Vector-Valued Functions 

Motivating Questions 

In this section, zve strive to understand the ideas generated by the following important questions: 

• What do we mean by the derivative of a vector-valued function and how do we calculate 
it? 

• What does the derivative of a vector-valued function measure? 

• What do we mean by the integral of a vector-valued function and how do we compute it? 

• How do we describe the motion of a projectile if the only force acting on the object is accel¬ 
eration due to gravity? 


Introduction 


A vector-valued function r determines a curve in space as the collection of terminal points of the 
vectors r(t). If the curve is smooth, it is natural to ask whether r(t) has a derivative. In the same 
way, our experiences with integrals in single-variable calculus prompt us to wonder what the 
integral of a vector-valued function might be and what it might tell us. We explore both of these 
questions in detail in this section. 

For now, let's recall some important ideas from calculus I. Given a function s that measures 
the position of an object moving along an axis, its derivative, s', is defined by 


s'(t) = lim 
h-+ o 


s(t + h) — s(t ) 
h : 


and measures the instantaneous rate of change of s with respect to time. In particular, for a fixed 
value t = a, s'(a) measures the velocity of the moving object, as well as the slope of the tangent 
line to the curve y = s(t) at the point (a, s(a)). 

As we work with vector-valued functions, we will strive to update these ideas and perspectives 
into the context of curves in space and outputs that are vectors. 

Preview Activity 9.7. Let r(f) = cos(f)i + sin(2f)j describe the path traveled by an object at time t. 


(a) Use appropriate technology to help you sketch the graph of the vector-valued function r, 
and then locate and label the point on the graph when t = tt. 

(b) Recall that for functions of a single variable, the derivative of a sum is the sum of the 
derivatives; that is, ^[/(x) + g(x)} = f'(x) + g'(x). With this idea in mind and viewing i 
and j as constant vectors, what do you expect the derivative of r to be? Write a proposed 
formula for r '(t). 
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(c) Use your result from part (b) to compute r'(7r). Sketch this vector r'(7r) as emanating from 
the point on the graph of r when t = ir , and explain what you think r'(7r) tells us about 
the object's motion. 


CXI 


The Derivative 


In single variable calculus, we define the derivative, f, of a given function / by 


/'(*) 


lim f( x + h ) ~ f ( x ) 

h —>0 h 


provided the limit exists. At a given value of a, f(a) measures the instantaneous rate of change 
of /, and also tells us the slope of the tangent line to the curve y = f(x) at the point (a, f(a)). The 
definition of the derivative extends naturally to vector-valued functions and curves in space. 


Definition 9.10. The derivative of a vector-valued function r is defined to be 

r(t + h) — r(t) 


r'(t) = lim 

Zl-5-0 


h 


for those values of t at which the limit exists. We also use the notation ^ and ^[r(t)] for r'(t). 


Activity 9.26. 

Let's investigate how we can interpret the derivative r'(f). Let r be the vector-valued function 
whose graph is shown in Figure 9.55, and let h be a scalar that represents a small change in 
time. The vector r(t) is the blue vector in Figure 9.55 and r(t, + h) is the green vector. 



Figure 9.55: A single difference quotient. 

(a) Is the quantity r(t + h) — r(t) a vector or a scalar? Identify this object in Figure 9.55. 
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(b) Is r ^ +/ 'fr a vector or a scalar? Sketch a representative vector r (*+U W) / t < i i n 
Figure 9.55. 

(c) Think of r(t) as providing the position of an object moving along the curve these vectors 

trace out. What do you think that the vector measures? Why? (Hint: You 

might think analogously about difference quotients such as or 

from calculus I.) 

(d) Figure 9.56 presents three snapshots of the vectors r ( t+h J~ r ( t '> as we ] e t h — > 0. Write 2-3 
sentences to describe key attributes of the vector 

r(i + h) — r(t) 

lim---. 

h — h 

(Hint: Compare to limits such as lim^o ^ x+h ^^ x ' > or lining from calculus 

I, keeping in mind that in three dimensions there is no general concept of slope.) 





Figure 9.56: Snapshots of several difference quotients. 
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As Activity 9.26 indicates, if r (t) determines the position of an object at time t, then r ( t + h j^ r( - t ') 
represents the average rate of change in the position of the object over the interval [t, t + h\, which 
is also the average velocity of the object on this interval. Thus, the derivative 


r '(t) = lim 

/i-MD 


r(t + h) 
h 


r (t) 


is the instantaneous rate of change of r (t) at time t (for those values of t for which the limit exists), 
so r'(t) = v(t) is the instantaneous velocity of the object at time t. Furthermore, we can interpret 
the derivative r' (t) as the direction vector of the line tangent to the graph of r at the value t. 


Similarly, 


v'(t) 


r "(t) = lim 
h-^ o 


v(t + h) 

h 


v(f) 


is the instantaneous rate of change of the velocity of the object at time t, for those values of t for 
which the limits exists, and thus v'(f) = a(t) is the acceleration of the moving object. 
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Note well: Both the velocity and acceleration are vector quantities: they have magnitude and 
direction. By contrast, the magnitude of the velocity vector, |v(f)|, which is the speed of the object 
at time t, is a scalar quantity. 


Computing Derivatives 


As we learned in single variable calculus, computing derivatives from the definition is often dif¬ 
ficult. Fortunately, properties of the limit make it straightforward to calculate the derivative of a 
vector-valued function similar to how we developed shortcut differentiation rules in calculus I. 
To see why, recall that the limit of a sum is the sum of the limits, and that we can remove con¬ 
stant factors from limits. Thus, as we observed in a particular example in Preview Activity 9.7, if 
r (t) = x{t) i + y(t) j + z(t) k, it follows that 


r'(t) = lim 
ft- 5-0 

= lim 

ft-*- o 


r(t + h ) — r(i) 
h 

[x(t + h)~ x(t)]i + [y(t + h) - y(t)\ j + [z(t + h) - z(i)]k 


h 


= (lim AtUA] i + ( lim vV + y-vm j + ( Hm 
\h— *0 h J \h— *0 h J yft—* o 

= x'(t)i + y'(t)i + z'(t)k. 


z{t + h) — z(t) 
h 


Thus, we can calculate the derivative of a vector-valued function by simply differentiating its 
components. 


If r(i) = x(t)i + y{t)} + z{t) k, then 

^r(f) = x'{t) i + y'{t) j + z'(t) k 

for those values of t at which x, y, and 2 are differentiable. 


Activity 9.27. 

For each of the following vector-valued functions, find r'(f). 

(a) r(t) = (cos(f), t sin(t), ln(t)). 

(b) r (t) = ( t 2 + 3 1, e ~ 2t , ^j). 

(c) r(t) = (tan(t), cos(f 2 ), te^*). 

(d) r(t) = (Vt 4 + 4, sin(3f), cos(4t)). 
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In first-semester calculus, we developed several important differentiation rules, including the 
constant multiple, product, quotient, and chain rules. For instance, recall that we formally state 
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the product rule as 

^[/(x) • g(x)\ = f(x) ■ g\x ) + g(x) ■ f'(x). 

There are several analogous rules for vector-valued functions, including a product rule for scalar 
functions and vector-valued functions. These rules, which are easily verified, are summarized in 
the following theorem. 

Theorem. Let / be a differentiable real-valued function of a real variable t and let r and s be 
differentiable vector-valued functions of the real variable t. Then 

1- ^[r(0 + s (0] =r'(f) + s'(f) 

2 - ^[/(*M*)] = f(ty(t) + f(t)r(t) 

3 - j f [r(i) • s (t)] = r'(f) • s(t) + r(t) ■ s'(f) 

4. [r(t) x s(t)] = r'(f) x s(f) + r (t) x s'(t) 

5 ‘ Jt = /(^^(/(O)- 


Note well. When applying these properties, use care to interpret the quantities involved as 
either scalars or vectors. For example, r(i) • s(t) defines a scalar function because we have taken 
the dot product of two vector-valued functions. However, r(t) x s(t) defines a vector-valued 
function since we have taken the cross product of two vector-valued functions. 

Activity 9.28. 

The left side of Figure 9.57 shows the curve described by the vector-valued function r defined 
by 

r(t) = ^2 1- W 2 + M- 1^ • 


(a) Find the object's velocity v(t). 

(b) Find the object's acceleration a (t). 

(c) Indicate on the left of Figure 9.57 the object's position, velocity and acceleration at the 
times t = 0,2,4. Draw the velocity and acceleration vectors with their tails placed at the 
object's position. 

(d) Recall that the speed is |v| = v'v • v. Find the object's speed and graph it as a function 
of time t on the right of Figure 9.57. When is the object's speed the slowest? When is 
the speed increasing? When it is decreasing? 
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Figure 9.57: The curve r(t) = (2 1 — \t 2 + 1, t — l) and its speed. 


(e) What seems to be true about the angle between v and a when the speed is at a min¬ 
imum? What is the angle between v and a when the speed is increasing? when the 
speed is decreasing? 

(f) Since the square root is an increasing function, we see that the speed increases precisely 
when v • v is increasing. Use the product rule for the dot product to express ^(v • v) 
in terms of the velocity v and acceleration a. Use this to explain why the speed is 
increasing when v • a > 0 and decreasing when v • a < 0. Compare this to part (d). 

(g) Show that the speed's rate of change is 

^|v(t)| = comp v a. 
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Tangent Lines 

One of the most important ideas in first-semester calculus is that a differentiable function is locally 
linear: that is, when viewed up close, the curve generated by a differentiable function looks very 
much like a line. Indeed, when we zoom in sufficiently far on a particular point, the curve looks 
indistinguishable from its tangent line. 

In the same way, we expect that a smooth curve in 3-space will be locally linear. In the follow¬ 
ing activity, we investigate how to find the tangent line to such a curve. Recall from our work in 
Section 9.5 that the vector equation of a line that passes through the point at the tip of the vector 
Lo = (xo, 2/0) z o) i n the direction of the vector u = (a, b. c ) can be written as 

L(t) = Lo + tu. 

In parametric form, the line L is given by 

x(t) = xq + at , y(t) = yo + bt, z(t) = z 0 + ct. 
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Activity 9.29. 

Let 


r(t) = cos(f)i — sin(t)j + fk. 12 


(a) Determine the coordinates of the point on the curve traced out by r (t) when t = tt. 

(b) Find a direction vector for the line tangent to the graph of r at the point where t = tt. 

(c) Find the parametric equations of the line tangent to the graph of r when t = tt. 

(d) Sketch a plot of the curve r(t) and its tangent line near the point where t = vr. In 
addition, include a sketch of r'(vr). What is the important role of r'(7r) in this activity? 

<1 

We see that our work in Activity 9.29 can be generalized. Given a differentiable vector-valued 
function r, the tangent line to the curve at the input value a is given by 

L(t) = r(o) + tr'(a). (9.11) 

Here we see that because the tangent line is determined entirely by a given point and direction, 
the point is provided by the function r, evaluated at t = a, while the direction is provided by the 
derivative, r', again evaluated at t = a. Note how analogous the formula for L(£) is to the tangent 
line approximation from single-variable calculus: in that context, for a given function y = f(x) at 
a value x = a, we found that the tangent line can be expressed by the linear function y = L(x) 
whose formula is 

L(x ) = /(a) + f'(a)(x - a). 

Equation (9.11) for the tangent line L(t) to the vector-valued function r(t) is nearly identical. In¬ 
deed, because there are multiple parameterizations for a single line, it is even possible 13 to write 
the parameterization as 

L(f) = r(a) + (t — a)r 7 (a). (9.12) 

As we will learn more in Chapter 10, a smooth surface in 3-space is also locally linear. That 
means that the surface will look like a plane, which we call its tangent plane, as we zoom in on 
the graph. It is possible to use tangent lines to traces of the surface to generate a formula for the 
tangent plane; see Exercise 3 at the end of this section for more details. 


Integrating a Vector-Valued Function 

Recall from single variable calculus that an antiderivative of a function / of the independent vari¬ 
able x is a function F that satisfies F'{x) = f(x). We then defined the indefinite integral f f(x) dx 

12 You can sketch the graph with Wolfram Alpha, the applet at http : //gvsu . edu/s/LR, or some other appropriate 
technology. 

13 In Equation (9.11), L(0) = r (a), so the line's parameterization "starts" at t = 0. When we write the parameterization 
in the form of Equation (9.12), L(a) = r(a), so the line's parameterization "starts" at t = a. 
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to be the general antiderivative of /. Recall that the general antiderivative includes an added con¬ 
stant C in order to indicate that the general antiderivative is in fact an entire family of functions. 
We can do the similar work with vector-valued functions. 


Definition 9.11. An antiderivative of a vector-valued function r is a vector-valued function R 
such that 

R'(f) = r (t). 

The indefinite integral f r(t) dt of a vector-valued function r is the general antiderivative of r 
and represents the collection of all antiderivatives of r. 


The same reasoning that allows us to differentiate a vector-valued function componentwise 
applies to integrating as well. Recall that the integral of a sum is the sum of the integrals and 
also that we can remove constant factors from integrals. So, given r(t) = x(t)i + y(t )j + z(t) k, it 
follows that we can integrate componentwise. Expressed more formally. 


If r(i) = x(t)i + y(f)j + z(t)k, then 


/ 


r(t) dt 


x(t) dt i + / y(t) dt j + 



k. 


In light of being able to integrate and differentiate componentwise with vector-valued func¬ 
tions, we can solve many problems that are analogous to those we encountered in single-variable 
calculus. For instance, recall problems where we were given an object moving along an axis with 
velocity function v and an initial position s(0). In that context, we were able to differentiate v in 
order to find acceleration, and integrate v and use the initial condition in order to find the position 
function s. In the following activity, we explore similar ideas with vector-valued functions. 

Activity 9.30. 

Suppose a moving object in space has its velocity given by 

v(f) = (-2 sin(2f))i + (2 cos(f))j + ^1 - k - 

A graph of the position of the object for times t in [—0.5, 3] is shown in Figure 9.58. Suppose 
further that the object is at the point (1.5, —1,0) at time t = 0. 

(a) Determine a (t), the acceleration of the object at time t. 

(b) Determine r(f ), position of the object at time t. 

(c) Compute and sketch the position, velocity, and acceleration vectors of the object at time 
t = 1, using Figure 9.58. 

(d) Finally, determine the vector equation for the tangent line, L(t), that is tangent to the 
position curve at t = 1. 
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Figure 9.58: The position graph for the function in Activity 9.30. 
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Projectile Motion 

Any time that an object is launched into the air with a given velocity and launch angle, the path 
the object travels is determined almost exclusively by the force of gravity. Whether in sports such 
as archery or shotput, in military applications with artillery, or in important fields like firefighting, 
it is important to be able to know when and where a launched projectile will land. We can use 
our knowledge of vector-valued functions in order to completely determine the path traveled by 
an object that is launched from a given position at a given angle from the horizontal with a given 
initial velocity. 



Figure 9.59: Projectile motion. 


Assume we fire a projectile from a launcher and the only force acting on the fired object is the 
force of gravity pulling down on the object. That is, we assume no effect due to spin, wind, or air 
resistance. With these assumptions, the motion of the object will be planar, so we can also assume 
that the motion occurs in two-dimensional space. Suppose we launch the object from an initial 
position (xo, Do ) at an angle 0 with the positive x-axis as illustrated in Figure 9.59, and that we fire 
the object with an initial speed of vq = | v(0)|, where v(t) is the velocity vector of the object at time 
t. Assume g is the positive constant acceleration force due to gravity, which acts to pull the fired 
object toward the ground (in the negative y direction). Note particularly that there is no external 
force acting on the object to move it in the x direction. 

We first observe that since gravity only acts in the downward direction and that the accelera¬ 
tion due to gravity is constant, the acceleration vector is (0, —g). That is, a(t) = (0, —g). We may 
use this fact about acceleration, together with the initial position and initial velocity in order to 
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fully determine the position r (t) of the object at time t. In Exercise 5, you can work through the 
details to show that the following general formula holds. 

If an object is launched from a point (xo,2/o) with initial velocity vq at an angle 9 with the 
horizontal, then the position of the object at time t is given by 

r(t) = (v 0 cos (9)t + x 0 , -| t 2 + v 0 sin(9)t + yo) • 

This assumes that the only force acting on the object is the acceleration g due to gravity. 


Summary 

• If r is a vector-valued function, then the derivative of r is defined by 

r (i + h) — r (t) 


r'(t) = lim 

/ l ->0 


h 


for those values of t at which the limit exists, and is computed componentwise by the formula 

r (t) = x'(t) i + y'{t )j + z'(t) k 


for those values of t at which x, y, and z are differentiable, where r(t) = x(t)i + y(t) j + z(t)k. 

• The derivative r' ( t ) of the vector-valued function r tells us the instantaneous rate of change of 
r with respect to time, t, which can be interpreted as a direction vector for the line tangent to 
the graph of r at the point r(t), or also as the instantaneous velocity of an object traveling along 
the graph defined by r (t) at time t. 

• An antiderivative of r is a vector-valued function R such that R'(f) = r (t). The indefinite 
integral f r(t) dt of a vector-valued function r is the general antiderivative of r (which is a 
collection of all of the antiderivatives of r, with any two antiderivatives differing by at most a 
constant vector). Moreover, if r(t) = x(t)i + y(t) j + z(t) k, then 


/ 


r(i) dt 


x(t) dt) i + / y(t) dt j + 



k. 


• If an object is launched from a point (xo,yo) with initial velocity vo at an angle 9 with the 
horizontal, then the position of the object at time t is given by 

r(f) = (v 0 cos (9)t + x 0 , -1 1 2 + vo sm(9)t + y 0 ) , 

provided that that the only force acting on the object is the acceleration g due to gravity. 
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Exercises 

1. Compute the derivative of each of the following functions in two different ways: (1) use the 
rules provided in the theorem stated just after Activity 9.27, and (2) rewrite each given function 
so that it is stated as a single function (either a scalar function or a vector-valued function with 
three components), and differentiate component-wise. Compare your answers to ensure that 
they are the same. 

(a) r(f) = sin(f)(2t, t 2 , arctan(f)) 

(b) s(f) = r(2 t ), where r(t) = (t + 2, ln(t), 1). 

(c) r (t) = (cos(f),sin(f), t) ■ (— sin(f),cos(f), 1) 

(d) r(t) = (cos(f), sin(f), t) x (— sin(f), cos(f), 1) 


2. Consider the two vector-valued functions given by 

r(t) = (t+l,co B (|t), r U^ 


and 


w(s) = sin . 


(a) Determine the point of intersection of the curves generated by r (t) and w(.s). To do so, 
you will have to find values of a and b that result in r (a) and w(6) being the same vector. 

(b) Use the value of a you determined in (a) to find a vector form of the tangent line to r (t) 
at the point where t = a. 

(c) Use the value of b you determined in (a) to find a vector form of the tangent line to w(s) 
at the point where s = b. 

(d) Suppose that z = f(x,y) is a function that generates a surface in three-dimensional 
space, and that the curves generated by r (t) and w(s) both lie on this surface. Note 
particularly that the point of intersection you found in (a) lies on this surface. In addi¬ 
tion, observe that the two tangent lines found in (b) and (c) both lie in the tangent plane 
to the surface at the point of intersection. Use your preceding work to determine the 
equation of this tangent plane. 


3. In this exercise, we determine the equation of a plane tangent to the surface defined by f(x, y) = 
yjx 2 + y 2 at the point (3,4, 5). 

(a) Find a parameterization for the x = 3 trace of /. What is a direction vector for the line 
tangent to this trace at the point (3,4, 5)? 

(b) Find a parameterization for the y = 4 trace of /. What is a direction vector for the line 
tangent to this trace at the point (3,4, 5)? 

(c) The direction vectors in parts (a) and (b) form a plane containing the point (3,4,5). 
What is a normal vector for this plane? 
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(d) Use your work in parts (a), (b), and (c) to deterring an equation for the tangent plane. 
Then, use appropriate technology to draw the graph of / and the plane you determined 
on the same set of axes. What do you observe? (We will discuss tangent planes in more 
detail in Chapter 10.) 


4. For each given function r, determine f r(t) dt. In addition, recalling the Fundamental Theorem 
of Calculus for functions of a single variable, also evaluate (J r(t) dt for each given function r. 
Is the resulting quantity a scalar or a vector? What does it measure? 


(a) r(t) 

(b) r(t) 

(c) r(t) 


cos(f), 


t + 1 


, te 


(cos(3i), sin(2t), t) 

(i + * M ’TTfi) 


5. In this exercise, we develop the formula for the position function of a projectile that has been 
launched at an initial speed of |vo| and a launch angle of 0. Recall that a (f) = (0, —g) is the 
constant acceleration of the projectile at any time t. 

(a) Find all velocity vectors for the given acceleration vector a. When you anti-differentiate, 
remember that there is an arbitrary constant that arises in each component. 

(b) Use the given information about initial speed and launch angle to find vo, the initial 
velocity of the projectile. You will want to write the vector in terms of its components, 
which will involve sin(0) and cos(0). 

(c) Next, find the specific velocity vector function v for the projectile. That is, combine 
your work in (a) and (b) in order to determine expressions in terms of |vo| and 6 for the 
constants that arose when integrating. 

(d) Find all possible position vectors for the velocity vector v(i) you determined in (c). 

(e) Let r(t) denote the position vector function for the given projectile. Use the fact that the 
object is fired from the position (.To, yo) to show it follows that 

r(t) = ^|v 0 | cos (0)t + t 0 , -|f 2 + |v 0 | sin (6)t + y 0 ) • 

6. A central force is one that acts on an object so that the force F is parallel to the object's position 
r. Since Newton's Second Law says that an object's acceleration is proportional to the force 
exerted on it, the acceleration a of an object moving under a central force will be parallel to 
its position r. For instance, the Earth's acceleration due to the gravitational force that the sun 
exerts on the Earth is parallel to the Earth's position vector as shown in Figure 9.60. 

(a) If an object of mass m is moving under a central force, the angular momentum vector 
is defined to be L = mr x v. Assuming the mass is constant, show that the angular 
momentum is constant by showing that 
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Earth 



Figure 9.60: A central force. 

(b) Explain why L • r = 0. 

(c) Explain why we may conclude that the object is constrained to lie in the plane passing 
through the origin and perpendicular to L. 
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9.8 Arc Length and Curvature 

Motivating Questions 

In this section, zve strive to understand the ideas generated by the following important questions: 

• How can a definite integral be used to measure the length of a curve in 2- or 3-space? 

• Why is arc length useful as a parameter? 

• What is the curvature of a curve? 


Introduction 

Given a space curve, there are two natural geometric questions one might ask: how long is the 
curve and how much does it bend? In this section, we answer both questions by developing 
techniques for measuring the length of a space curve as well as its curvature. 

Preview Activity 9.8. In earlier investigations, we have used integration to calculate quantities 
such as area, volume, mass, and work. We are now interested in determining the length of a space 
curve. 

Consider the smooth curve in 3-space defined by the vector-valued function r, where 

r (t) = (x(t),y(t),z(t)) = (cos(f), sin(t), f) 

for t in the interval [0, 2-k\. Pictures of the graph of r are shown in Figure 9.61. We will use the 
integration process to calculate the length of this curve. In this situation we partition the interval 
[0, 27 t] into n subintervals of equal length and let 0 = to < h < t 2 < ■ ■ ■ < t n = b be the endpoints 
of the subintervals. We then approximate the length of the curve on each subinterval with some 
related quantity that we can compute. In this case, we approximate the length of the curve on 
each subinterval with the length of the segment connecting the endpoints. Figure 9.61 illustrates 
the process in three different instances using increasing values of n. 



Figure 9.61: Approximating the length of the curve with n = 3, n = 6, and n = 9. 
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(a) Write a formula for the length of the line segment that connects the endpoints of the curve 
on the ith subinterval i. t-,]. (This length is our approximation of the length of the curve 
on this interval.) 

(b) Use your formula in part (a) to write a sum that adds all of the approximations to the 
lengths on each subinterval. 

(c) What do we need to do with the sum in part (b) in order to obtain the exact value of the 
length of the graph of r(i) on the interval [0, 2tt]1 


CXI 


Arc Length 

Consider a smooth curve in 3-space that is parametrically described by the vector-valued function 
r defined by r(t) = (x(t),y(t), z(t)). Preview Activity 9.8 shows that to approximate the length of 
the curve defined by r(t) as the values of t run over an interval [a, b], we partition the interval [a, b] 
into n subintervals of equal length At, with a = to < h < ■ ■ ■ < t n = b as the endpoints of the 
subintervals. On each subinterval, we approximate the length of the curve by the length of the line 
segment connecting the endpoints. The points on the curve corresponding to t = i,_i and t = t, 
are (x(U- 1 ), y(U- 1 ), z(U- 1 )) and (x(ti), y(U), z{ti)), respectively, so the length of the line segment 
connecting these points is 

y 7 (x(U) - x(U- 1 )) 2 + (y(U) - y(ti-i)) 2 + (z(U) - z{U- 1 )) 2 . 

Now we add all of these approximations together to obtain an approximation to the length L of 
the curve: 

n 

L « ^2 'J - X {U- 1 )) 2 + (y(U) - y(U- 1 )) 2 + (z(u) - z(ti- 1 )) 2 . 

2—1 

We now want to take the limit of this sum as n goes to infinity, but in its present form it might be 
difficult to see how. We first introduce At by multiplying by , and see that 


L « ^2 V 7 i x {ti) - x (ti- 1 )) 2 + (y{ti) - y{ti- 1 )) 2 + (z(ti) - z(ti- 1 )) 2 

2—1 

n f _ A f 

= v 7 (xiu) - x(ti- 1 )) 2 + (y(ti) - y(ti- 1 )) 2 + (z(ti) - z(L_i )) 2 — 

2—1 

n f _ A f 

= V 7 (x(ti) - x{ti_ i )) 2 + (y(ti) - y(ti- 1 )) 2 + (z(ti) - z(U- 1)) 2 ^=== 

V( At r 


@®®@ 
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To get the difference quotients under the radical, we use properties of the square root function to 
see further that 


L 


E 




- x{ti- 1 )) 2 + - y(U- 1 )) 2 + (z(ti) 


- x(tj- 1 ) \ 2 / y(tj) - y(tj- 1 ) \ 2 

At J \ At j 


- Z(ti- 1 ) 2 ] 


(Af) s 


z(tj) - z(tj- 1 ) 

At 


At 

At. 


Recall that as n —> oo we also have At —> 0. Since 


x(ti)-x(ti- 1 ) 

lim --- 

At-»o At 




lim 

Ai-s-0 


y(tj) - y(tj- 1 ) 
At 


y'(t), and 


z(U) - z(U- 1 ) 

nm --- 

At—>0 At 


z'(t) 


we see that 


L = lim ) 

n—>oc A—/ 
i —1 

n 

lim > 

At—>0 z —'' 


x(ti) - x(U-i)\ fy(ti)-y(ti- i)V , { z(ti) - z(U- 1 ) 


At 


+ 


At 


+ 


At 


At 


x(ti) - x(ti-i)\ (y(ti)-y{ti- i)V f z[ti) - z(ti-i) 


1=1 


At 


+ 


At 


+ 


At 


At 


= / V ( x'{t )) 2 + ( y'(t )) 2 + ( z’(t )) 2 dt. 


(9.13) 


Noting further that 

\r'(t)\ = V(x'(t)) 2 + {y'(t)) 2 + (V(t)) 2 , 

we can rewrite (9.13) in a more succinct form as follows. 


If r(t) defines a smooth curve C on an interval [a, b\, then the length L of C is given by 


L= j |r'(t)| dt. (9.14) 

J a 


Note that formula (9.14) applies to curves in any dimensional space. Moreover, this formula 
has a natural interpretation: if r(t) records the position of a moving object, then r'U) is the object's 
velocity and |r'(i)| its speed. Formula (9.14) says that we simply integrate the speed of an object 
traveling over the curve to find the distance traveled by the object, which is the same as the length 
of the curve, just as in one-variable calculus. 

Activity 9.31. 

Here we calculate the arc length of two familiar curves. 

(a) Use Equation (9.14) to calculate the circumference of a circle of radius r. 
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(b) Find the exact length of the spiral defined by r(i) = (cos(f), sin(f), t) on the interval 

[0,24 


<1 

We can adapt the arc length formula to curves in 2-space that define y as a function of x as the 
following activity shows. 

Activity 9.32. 

Let y = f(x) define a smooth curve in 2-space. Parameterize this curve and use Equation (9.14) 
to show that the length of the curve defined by / on an interval [a, b] is 



V 1 + [f'(t)] 2 dt. 




Parameterizing With Respect To Arc Length 


In addition to helping us to find the length of space curves, the expression for the length of a 
curve enables us to find a natural parametrization of space curves in terms of arc length, as we 
now explain. 

Shown below in Figure 9.62 is a portion of the parabola y = x 2 / 2. Of course, this space curve 
may be parametrized by the vector-valued function r defined by r(i) = (t,t 2 / 2) as shown on the 
left, where we see the location at a few different times t. Notice that the points are not equally 
spaced on the curve. 

A more natural parameter describing the points along the space curve is the distance traveled 
s as we move along the parabola starting at the origin. For instance, the right side of Figure 9.62 
shows the points corresponding to various values of s. We call this an arc length parametrization. 




Figure 9.62: The parametrization r(f) (left) and a reparametrization by arc length. 
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To see that this is a more natural parametrization, consider an interstate highway cutting across 
a state. One way to parametrize the curve defined by the highway is to drive along the highway 
and record our position at every time, thus creating a function r. If we encounter an accident or 
road construction, however, this parametrization might not be at all relevant to another person 
driving the same highway. An arc length parametrization, however, is like using the mile markers 
on the side of road to specify our position on the highway. If we know how far we've traveled 
along the highway, we know exactly where we are. 

If we begin with a parametrization of a space curve, we can modify it to find an arc length 
parametrization, as we now describe. Suppose that the curve is parametrized by the vector-valued 
function r = r(t) where t is in the interval [a, b\. We define the parameter s through the function 

s = L(t ) = f \J ( x'{w )) 2 + ( y'{w )) 2 + (z'{w )) 2 dw, 

J a 


which measures the length along the curve from r(a) to r(t). 
The Fundamental Theorem of Calculus shows us that 


— = L'(t) = y/ ( x'{t )) 2 + (y'{t )) 2 + (U(t )) 2 = \r'(t)\ (9.15) 

and so 

m = f 

J a 

If we assume that r'(t) is never 0, then L'(t) > 0 for all t and s = L(t ) is always increasing. This 
should seem reasonable: unless we stop, the distance traveled along the curve increases as we 
move along the curve. 

Since s = L(t ) is an increasing function, it is invertible, which means we may view the time t 
as a function of the distance traveled; that is, we have the relationship t = L~ 1 (s). We then obtain 
the arc length parametrization by composing r(t) with t = L~ 1 (s) to obtain r(s). Let's illustrate 
this with an example. 




dw. 


Example 9.1. Consider a circle of radius 5 in 2-space centered at the origin. We know that we can 
parameterize this circle as 

r(t) = (5 cos(t), 5 sin(t)), 

where t runs from 0 to 27 t. We see that r'(t) = (—5 sin(t), 5 cos(t)), and hence |r'(i)| = 5. It then 
follows that 

s = L(t ) = f |r 
Jo 

Since s = L(t) = 5 1, we may solve for t in terms of s to obtain t = L _1 (s) = s/5. We then find the 
arc length parametrization by composing 


'{w)\dw= / 5 dw = 5t. 
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More generally, for a circle of radius a centered at the origin, a similar computation shows that 


r(s) = 




(9.16) 


is an arc length parametrization. 


Notice that equation (9.15) shows that 


dr dr ds dr , 

dt ds dt ds 


dr 


1 dr 

ds 


r '(t) dt 


which means that we move along the curve with unit speed when we parameterize by arc length. 
This is clearly seen in Example 9.1 where |r'(s)| = 1. It follows that the parameter s is the distance 
traveled along the curve, as shown by: 


L(s) 



dw 



= s. 


Activity 9.33. 

In this activity we parameterize a line in 2-space in terms of arc length. Consider the line with 
parametric equations 

x(t) = xq + at and y(t) = yo + bt. 


(a) To write t in terms of s, evaluate the integral 

s = L(t ) = f \J(x'(w)) 2 + ( y'(w)) 2 dw 
Jo 

to determine the length of the line from time 0 to time t. 

(b) Use the formula from (a) for s in terms of t to write t in terms of s. Then explain why a 
parameterization of the line in terms of arc length is 

x(s) = xn -|- s and y(s) = yn 4- , s. (9.17) 


A little more complicated example is the following. 
Example 9.2. Let us parameterize the curve defined by 

r(t) = (t 2 .|( 3/2 .4t) 



<1 
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for t > 0 in terms of arc length. To write t in terms of s we find s in terms of t: 

s(t ) = [ y/ ( x'(w )) 2 + ( y'{w )) 2 + (z'(w)) 2 dw 

Jo 

= j yj (2 w) 2 + {Aw 1 / 2 ) 2 + (4) 2 dw 

rt 

= / yj 4 w 2 + 16u? + 16 dw 
Jo 

= 2 y/(w + 2) 2 dw 

do 

= 2 / w + 2 dw 

Jo 

t 

= (w 2 + 4w) 


o 


= r + At. 


Since t > 0, we can solve the equation s = t 2 + At (or t 2 + At — s = 0) for t to obtain t = 
— 2 + y/A + s. So we can parameterize our curve in terms of arc length by 

r(s) = ^(“ 2 + V4 + s ) 2 , ^ (-2 + \/4 + s) 3/2 ,4 (-2 + y/A + s)^ . 


-4+yi6+4s 

2 


These examples illustrate a general method. Of course, evaluating an arc length integral and 
finding a formula for the inverse of a function can be difficult, so while this process is theoretically 
possible, it is not always practical to parameterize a curve in terms of arc length. However, we can 
guarantee that such a parameterization exists, and this observation plays an important role in the 
next section. 


Curvature 


For a smooth space curve, the curvature measures how fast the curve is bending or changing direc¬ 
tion at a given point. For example, we expect that a line should have zero curvature everywhere, 
while a circle (which is bending the same at every point) should have constant curvature. Circles 
with larger radii should have smaller curvatures. 

To measure the curvature, we first need to describe the direction of the curve at a point. We 
may do this using a continuously varying tangent vector to the curve, as shown in 9.63. The di¬ 
rection of the curve is then determined by the angle 4> this tangent vector makes with a horizontal 
vector, as shown in 9.64. 

Informally speaking, the curvature will be the rate at which the angle 4> is changing as we move 
along the curve. Of course, this rate of change will depend on how we move along the curve; if 
we move with a greater speed along the curve, then <f> will change more rapidly. This is why the 
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Figure 9.63: Tangent vectors to an ellipse. 


Figure 9.64: Angles of tangent vectors. 


speed limit is sometimes lowered when we enter a curve on a highway. In other words, the rate of 
change of 4> will depend on the parametrization we use to describe the space curve. To eliminate 
this dependence on the parametrization, we choose to work with an arc length parametrization 
r(s), which means we move along the curve with unit speed. 

Using an arc length parametrization r(s), we define the tangent vector T(s) = r'(s), and note 
that |T(s)| = 1; that is, T(s) is a unit tangent vector. We then have T(s) = (cos(</>(s)), sin(</>(s))), 
which means that 

^ T = /-sincos(0(s))^\ = (-sin(0(s)), cos(0(s)))^. 
as \ as as / as 


Therefore 


dT 

ds 


|(-sin(0(s)), cos(^(s)))| 


d(j> 


d<j> 

ds 


ds 


This observation leads us to adopt 


Definition 9.12. If C is a smooth space curve and s is an arc length parameter for C, then the 
curvature, k, u of C is 


k = k(s) 


dT 

ds 


Example 9.3. We should expect that the curvature of a line is 0 everywhere. To show that our 
definition of curvature measures this correctly in 2-space, recall that (9.17) gives us the arc length 
parameterization 


x(s) = Xq + 


yj a 2 + b 2 

of a line. So the unit tangent vector is 

T (s) = 


s and y(s) = y 0 + 


V a 2 + b 2 


Va 2 + b 2 ' Va 2 + b 2 
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and, since T(s) is constant, we have 



= 0 


as expected. 


Activity 9.34. 


Recall that an arc length parameterization of a circle in 2-space of radius a centered at the origin 
is, from (9.16), 


r(s) = 




Show that the curvature of this circle is the constant K What can you say about the relationship 
between the size of the radius of a circle and the value of its curvature? Why does this make 
sense? 


<1 

The definition of curvature relies on our ability to parameterize curves in terms of arc length. 
Since we have seen that finding an arc length parametrization can be difficult, we would like to be 
able to express the curvature in terms of a more general parametrization r(i). 

To begin, we need to describe the vector T, which is a vector tangent to the curve having unit 
length. Of course, the velocity vector r' it) is tangent to the curve; we simply need to normalize its 
length to be one. This means that we may take 


m 


At) 

r/ wr 


Then the curvature of the curve defined by r is 


dT 


dT dt 
dt ds 


I dT I 
I dt I 

I — I 
I dt I 


T'(*)| 

\r'(t)\ ■ 


This last formula allows us to use any parameterization of a curve to calculate its curvature. 
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There is another useful formula, given below, whose derivation is left for the exercises. 


If r is a vector-valued function defining a smooth space curve C, and if r'(t) is not zero and if 
r "(t) exists, then the curvature k of C satisfies 


• k = n(t) 


|T'(*)I 

r'W! 


_ |r'(t)xr"(t)| 
- |r'(t)|3 


Activity 9.35. 

Use one of the two formulas for k in terms of t to help you answer the following questions. 

2 2 

(a) The ellipse ^ = 1 has parameterization 

r (t) = (acos(f), 6sin(t)). 

Find the curvature of the ellipse. Assuming 0 < b < a, at what points is the curvature 
the greatest and at what points is the curvature the smallest? Does this agree with your 
intuition? 

(b) The standard helix has parameterization r(t) = cos(t)i + sin(t)j + tk. Find the curvature 
of the helix. Does the result agree with your intuition? 


<1 

The curvature has another interpretation. Recall that the tangent line to a curve at a point is the 
line that best approximates the curve at that point. The curvature at a point on a curve describes 
the circle that best approximates the curve at that point. Remembering that a circle of radius a 
has curvature 1/a, then the circle that best approximates the curve near a point on a curve whose 
curvature is k has radius 1 /k and will be tangent to the tangent line at that point. This circle, called 
the osculating circle of the curve at the point, is shown in Figure 9.65 for a portion of a parabola. 


Summary 

• The integration process shows that the length L of a smooth curve defined by r(t) on an interval 


[a, b] is 


L = 


|r 7 (t)| dt. 


• Arc length is useful as a parameter because when we parameterize with respect to arc length, 
we eliminate the role of speed in our calculation of curvature and the result is a measure that 
depends only on the geometry of the curve and not on the parameterization of the curve. 

• We define the curvature k of a curve in 2- or 3-space to be the rate of change of the unit tangent 
vector with respect to arc length, or 

(IT 
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Figure 9.65: The osculating circle 


Exercises 

1. Consider the moving particle whose position at time t in seconds is given by the vector-valued 
function r defined by r(t) = 5ti + 4 sin(3f)j + 4 cos(3t)k. Use this function to answer each of the 
following questions. 

(a) Find the unit tangent vector, T(t), to the spacecurve traced by r(t) at time t. Write one 
sentence that explains what T(f) tells us about the particle's motion. 

(b) Determine the speed of the particle moving along the spacecurve with the given pa¬ 
rameterization. 

(c) Find the exact distance traveled by the particle on the time interval [0,7r/3]. 

(d) Find the average velocity of the particle on the time interval [0, vr/3]. 

(e) Determine the parameterization of the given curve with respect to arc length. 

2. Let y = f(x) define a curve in the plane. We can consider this curve as a curve in three-space 
with ^-coordinate 0. 

(a) Find a parameterization of the form r(t) = (x(t),y(t), z(t)) of the curve y = f(x) in 
three-space. 

(b) Use the formula 

_ |r'(f) X r"(f)| 

K |r'(f )| 3 

to show that 

\f"(x)\ 

[i + (/'W)W 

3. Consider the single variable function defined by y = 4x 2 — x 3 . 
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(a) Find a parameterization of the form r(t) = (x(t),y(t)) that traces the curve y = 4x 2 — x 3 
on the interval from x = — 3 to x = 3. 

(b) Write a definite integral which, if evaluated, gives the exact length of the given curve 
from x = —3 to x = 3. Why is the integral difficult to evaluate exactly? 

(c) Determine the curvature, n{t), of the parameterized curve. (Exercise 2 might be useful 
here.) 

(d) Use appropriate technology to approximate the absolute maximum and minimum of 
n(t) on the parameter interval for your parameterization. Compare your results with 
the graph of y = 4.x 2 — x 3 . How do the absolute maximum and absolute minimum of 
n(t) align with the original curve? 

4. Consider the standard helix parameterized by r (t) = cos(f)i + sin(t)j + fk. 

(a) Recall that the unit tangent vector, T(£), is the vector tangent to the curve at time t that 
points in the direction of motion and has length 1. Find T(t). 

(b) Explain why the fact that |T(t)| = 1 implies that T and T' are orthogonal vectors for 
every value of t. (Hint: note that T ■ T = |T| 2 = 1, and compute ^[T ■ T].) 

(c) For the given function r with unit tangent vector T(f) (from (a)), determine N(t) = 

|T'(t)| T ^' 

(d) What geometric properties does N(i) have? That is, how long is this vector, and how is 
it situated in comparison to T(t)? 

(e) Let B(£) = T(£) x N(f), and compute B(£) in terms of your results in (a) and (c). 

(f) What geometric properties does B (t) have? That is, how long is this vector, and how is 
it situated in comparison to T(f) and N(£)? 

(g) Sketch a plot of the given helix, and compute and sketch T(7r/2), N(7 t/ 2), and B (vr/2). 

5. In this exercise we verify the curvature formula 

_ |r'(f) X r"(t)| 

|r'(f)| 3 ‘ 


(a) Explain why 




ds 
dt' 


(b) Use the fact that T(f) = ppyy and |r'(£)| = to explain why 
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(c) The Product Rule shows that 


d 2 s 


ds rril / 


1 (t) = + - T (t) 


dt 


Explain why 


r'(i)xr"(i)= (^V(T(t)xT'(t)). 


(d) In Exercise 4 we showed that |T(f)| = 1 implies that T(i) is orthogonal to T'(t) for every 
value of t. Explain what this tells us about |T(i) x T'(f)| and conclude that 


I r '(t) x r "(t)| = (||) 2 |T'(0I- 


(e) Finally, use the fact that k = K Ml to verify that 


K = 


|r'(t) x r"(*)| 
|r'(t)| 3 
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Chapter 10 

Derivatives of Multivariable Functions 


10.1 Limits 

Motivating Questions 

In this section, zve strive to understand the ideas generated by the following important questions: 

• What do we mean by the limit of a function / of two variables at a point ( a,b )? 

• What techniques can we use to show that a function of two variables does not have a limit 
at a point (a, b )? 

• What does it mean for a function / of two variables to be continuous at a point (a, b)? 


Introduction 

In this section, we will study limits of functions of several variables, with a focus on limits of 
functions of two variables. In single variable calculus, we studied the notion of limit, which turned 
out to be a critical concept that formed the basis for the derivative and the definite integral. In this 
section we will begin to understand how the concept of limit for functions of two variables is 
similar to what we encountered for functions of a single variable. The limit will again be the 
fundamental idea in multivariable calculus, and we will use this notion of the limit of a function 
of several variables to define the important concept of differentiability later in this chapter. We 
have already seen its use in the derivatives of vector-valued functions in Section 9.7. 

Let's begin by reviewing what we mean by the limit of a function of one variable. We say that 
a function / has a limit L as x approaches a provided that we can make the values f(x) as close to 
L as we like by taking x sufficiently close (but not equal) to a. We denote this behavior by writing 

lim f(x) = L. 

X^HL 
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Preview Activity 10.1. We investigate the limits of several different functions by working with 
tables and graphs. 

(a) Consider the function / defined by 

f(x) = 3 - x. 

Complete the following table of values. 


X 

/(*) 

- 0.2 


- 0.1 


0.0 


0.1 


0.2 



What does the table suggest regarding lim x _>o f(x)7 

(b) Explain how your results in (a) are reflected in Figure 10.1. 



Figure 10.1: The graph of f(x) = 3 - x. 


(c) Next, consider 

aix) = 

|z| 

Complete the following table of values near x = 0, the point at which g is not defined. 


X 

g ( x ) 

- 0.1 


- 0.01 


- 0.001 


0.001 


0.01 


0.1 



What does this suggest about lim x ^ 0 g{x)7 
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Figure 10.2: The graph of g(x) = A. 


(d) Explain how your results in (c) are reflected in Figure 10.2. 

(e) Now, let's examine a function of two variables. Let 

f(x,y) = 3 - x - 2y 

and complete the following table of values. 


x\y 

-1 

-0.1 

0 

0.1 

1 

-1 






-0.1 






0 






0.1 






1 







What does the table suggest about lim^^^^o) f(x. y)? 

(f) Explain how your results in (e) are reflected in Figure 10.3. Compare this limit to the limit 
in part (a). How are the limits similar and how are they different? 


(g) Finally, consider 


g(x,y) = 


2 xy 


2 i 2 ’ 

ar + 


which is not defined at (0,0), and complete the following table of values of g(x, y). 


x\y 

-1 

-0.1 

0 

0.1 

1 

-1 






-0.1 






0 



— 



0.1 






1 







What does this suggest about g(x, y)? 
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Figure 10.3: At left, the graph of f(x, y) = 3 — x — 2y; at right, its contour plot. 


(h) Explain how your results are reflected in Figure 10.4. Compare this limit to the limit in 
part (b). How are the results similar and how are they different? 



CXI 


Limits of Functions of Two Variables 

In Preview Activity 10.1, we recalled the notion of limit from single variable calculus and saw that 
a similar concept applies to functions of two variables. Though we will focus on functions of two 
variables, for the sake of discussion, all the ideas we establish here are valid for functions of any 
number of variables. In a natural followup to our work in Preview Activity 10.1, we now formally 
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define what it means for a function of two variables to have a limit at a point. 


Definition 10.1. Given a function / = f(x. y), we say that / has limit L as ( x , y) approaches (a, b ) 
provided that we can make f(x, y) as close to L as we like by taking (x, y) sufficiently close 
(but not equal) to (a, b ). We write 

lim f(x, y) = L. 

( x,y)-y(a,b) 


To investigate the limit of a single variable function, lim x ^ 0 fix), we often consider the behav¬ 
ior of / as x approaches a from the right and from the left. Similarly, we may investigate limits of 
two-variable functions, lim/ Xi2/ )_q afe ) f(x, y) by considering the behavior of / as (x. y) approaches 
(a, b) from various directions. This situation is more complicated because there are infinitely many 
ways in which (x, y) may approach (a, b). In the next activity, we see how it is important to con¬ 
sider a variety of those paths in investigating whether or not a limit exists. 

Activity 10.1. 

Consider the function /, defined by 


f(x,y) 


y 

\/x 2 + y 2 ' 


whose graph is shown below in Figure 10.5 



Figure 10.5: The graph of f{x, y) = 


(a) Is / defined at the point (0,0)? What, if anything, does this say about whether / has a 
limit at the point (0,0)? 

(b) Values of / (to three decimal places) at several points close to (0,0) are shown in the 
table below. 













98 


10.1. LIMITS 


x\y 

-1 

-0.1 

0 

0.1 

1 

-1 

-0.707 

— 

0 

— 

0.707 

-0.1 

— 

-0.707 

0 

0.707 

— 

0 

-1 

-1 

— 

1 

1 

0.1 

— 

-0.707 

0 

0.707 

— 

1 

-0.707 

— 

0 

— 

0.707 


Based on these calculations, state whether / has a limit at (0,0) and give an argument 
supporting your statement. (Hint: The blank spaces in the table are there to help you 
see the patterns.) 

Now let's consider what happens if we restrict our attention to the x-axis; that is, con¬ 
sider what happens when y = 0. What is the behavior of f(x, 0) as x — > 0? If we 
approach (0,0) by moving along the x-axis, what value do we find as the limit? 

What is the behavior of / along the line y = x when x > 0; that is, what is the value of 
f(x, x ) when x > 0? If we approach (0,0) by moving along the line y = x in the first 
quadrant (thus considering f(x, x) as x —? 0, what value do we find as the limit? 

In general, if lim^^^o) /(u?/) = L, then f(x,y ) approaches L as (x,y) approaches 
(0,0), regardless of the path we take in letting (x, y) —> (0,0). Explain what the last two 
parts of this activity imply about the existence of lim^ )2/ )_q 0j o) f(x, y). 

Shown below in Figure 10.6 is a set of contour lines of the function /. What is the 
behavior of f(x,y) as (x,y) approaches (0,0) along any straight line? How does this 
observation reinforce your conclusion about the existence of lim( a . j j / )_q 0 ,o) f( x - y) from 
the previous part of this activity? (Hint: Use the fact that a non-vertical line has equation 
y = rax for some constant m.) 




(c) 

(d) 

(e) 

(f) 
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As we have seen in Activity 10.1, if we approach (a, b ) along two different paths and find that 
f(x,y ) has two different limits, we can conclude that lim( X)y w( a ^ f(x,y ) does not exist. This is 
similar to the one-variable example /(x) = x/\x\ as shown in 10.7; lim.,^o f(x) does not exist 
because we see different limits as x approaches 0 from the left and the right. 





















1 

















Figure 10.7: The graph of g(x ) = A. 


As a general rule, we have 

If f(x, y ) has two different limits as (x, y) approaches (a, b ) along two different paths, then 
lim( XiJ ,)^.(a i6 ) f(x, y) does not exist. 


As the next activity shows, studying the limit of a two-variable function / by considering the 
behavior of / along various paths can require subtle insights. 

Activity 10.2. 

Let's consider the function g defined by 

f ^ x 2 y 

9{x,y) = ——2 
x 4 + y z 

and investigate the limit lim^ ^^o.o) 9{ x i u)- 

(a) What is the behavior of g on the x-axis? That is, what is g(x. 0) and what is the limit of 
g as (x, y) approaches (0,0) along the x-axis? 

(b) What is the behavior of g on the y-axis? That is, what is g(0, y) and what is the limit of 
g as (x, y) approaches (0,0) along the y-axis? 

(c) What is the behavior of g on the line y = mx ? That is, what is g(x, mx) and what is the 
limit of g as (x, y) approaches (0,0) along the line y = mx? 

(d) Based on what you have seen so far, do you think lim^ ^^o 0 ) g{x, y) exists? If so, what 
do you think its value is? 

(e) Now consider the behavior of g on the parabola y = x 2 ? What is g(x, x 2 ) and what is 
the limit of g as (x, y) approaches (0, 0) along this parabola? 
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(f) State whether the limit lim^ 0 ) g{x, y) exists or not and provide a justification of 
your statement. 


<1 

This activity shows that we need to be careful when studying the limit of a two-variable func¬ 
tions by considering its behavior along different paths. If we find two different paths that result in 
two different limits, then we may conclude that the limit does not exist. However, we can never 
conclude that the limit of a function exists only by considering its behavior along different paths. 

Generally speaking, concluding that a limit lim^^^^) f(x. y) exists requires a more careful 
argument. For example, let's consider the function / defined by 

f(x,y) = 2 l 2 
x z + y z 


and ask whether lim( XiJ/ )_ > .(o,o) /(*, y) exists. 

Note that if either x or y is 0, then f(x, y) 

We will therefore argue that 

lim 

(z,2/)-K0.0) ’ 

by showing that we can make f(x,y ) as close to 0 as we wish by taking (x, y) sufficiently close 
(but not equal) to (0,0). In what follows, we view x and y as being real numbers that are close, but 
not equal, to 0. 

Since 0 < x 2 , we have 

2 ^ 2 , 2 
y <x +y , 

which implies that 


x 2 + y 2 

Multiplying both sides by x 2 and observing that f(x. y) > 0 for all (x. y) gives 


= 0. Therefore, if / has a limit at (0,0), it must be 0. 


f(x,y) = 0, 


0 < f(x, y) 


= x 


x 2 + y 2 



< x 


This shows that we can make f(x, y) as close to 0 as we like by taking x sufficiently close to 0 (for 
this example, it turns out that we don't even need to worry about making y close to 0). Therefore, 


x 2 y 

lim = 

0 , 0 ) x z + y z 


= 0 . 


In spite of the fact that these two most recent examples illustrate some of the complications 
that arise when studying limits of two-variable functions, many of the properties that are familiar 











10.1. LIMITS 


101 


from our study of single variable functions hold in precisely the same way For instance. 


Properties of Limits. Let / = f(x, y) and g = g(x, y) be functions so that lim f(x , y) and 

(x,y)^(a,b) 

lim g(x, y) both exist. Then 

(x,y)^(a,b) 

1. lim x = a and lim y = b. 

(x,y)-t(a,b) (x,y)-y(a,b) 


lim cf(x, y) = c ( lim f(x, y) ) for any scalar c, 

{x,y)^(a,b) \(x,y)^(a,b) J 


2 . 


3. lim [f(x,y)±g(x,y)] = lim f(x,y)± lim g(x,y), 

(x,y)->(a,b) (x,y)->(a,b) (x,y)->(a,b) 


4. lim [f(x,y)-g(x,y)\=l lim f(x,y) • lim g(x,y)), 
{x,y)-¥(a,b) \(x,y)-t(a,b) J \(x,y)^>-(a,b) 


5. lim /(^^) _ if 


(x,y)—*(a,b) g{x, y) lim( a . !j ,)_ ) .( 0i6 ) g(x, y ) (x,y)^(a,b) 


lim g(x, y) / 0. 


We can use these properties and results from single variable calculus to verify that many limits 
exist. For example, these properties show that the function / defined by 

f(x. y) = 3 x 2 y 3 + 2 xy 2 — 2>x + 1 

has a limit at every point (a, b ) and, moreover, 

, I™ f(x,y) = f(a, b). 

(x,y)->(a,b) 

The reason for this is that polynomial functions of a single variable have limits at every point. 


Continuity 

Recall that a function / of a single variable x is said to be continuous at x = a provided that the 
following three conditions are satisfied: 

1. f(a) exists, 

2. lim x ^ a f(x) exists, and 

3. lim x _>. a f(x) = /(a). 

Using our understanding of limits of multivariable functions, we can define continuity in the same 
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way. 

Definition 10.2. A function / = f(x, y ) is continuous at the point (a, b ) provided that 

1. / is defined at the point (a, b), 

2. lim f(x, y) exists, and 

(x,y)-^(a,b) 

3. lim f(x,y) = f(a,b). 

(a :,y)-*(a,b) 


For instance, we have seen that the function / defined by f(x, y) = 3 x 2 y 3 + 2 xy 2 — 3x + 1 
is continous at every point. And just as with single variable functions, continuity has certain 
properties that are based on the properties of limits. 

Properties of Continuity. Let / and g be functions of two variables that are continuous at the 
point (a, b). Then 

1. cf is continuous at (o, b ) for any scalar c 

2. f + g is continuous at (a, b) 

3. f — g is continuous at (a, b) 

4. fg is continuous at (a, b) 

5. ^ is continuous at (a, b ) if g(a, 6) / 0 


Using these properties, we can apply results from single variable calculus to decide about 
continuity of multivariable functions. For example, the coordinate functions / and g defined by 
f(x, y) = x and g(x, y) = y are continuous at every point. We can then use properties of continuity 
listed to conclude that every polynomial function in x and y is continuous at every point. For 
example, g(x, y) = x 2 and h(x, y) = y 3 are continuous functions, so their product f(x, y) = x 2 y ‘ is 
a continuous multivariable function. 

Summary 

• A function / = f(x,y) has a limit L at a point (a, b ) provided that we can make f(x,y) as close 
to L as we like by taking (x, y) sufficiently close (but not equal) to (a, b). 

• If (x, y) has two different limits as (x. y) approaches (a, b) along two different paths, we can 

conclude that /( x, y) does not exist. 

• Properties similar to those for one-variable functions allow us to conclude that many limits 
exist and to evaluate them. 
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• A function / = f(x, y ) is continuous at a point (a, b ) in its domain if / has a limit at (a, 6 ) and 

f{a, b) = lim f(x,y). 

{x,y)^{a,b) 


Exercises 

1. Consider the function / defined by f(x, y) 


xy 

x 2 + y 2 + 1 


(a) What is the domain of /? 

(b) Evaluate limit of / at (0,0) along the following paths: x = 0, y = 0, y = x, and y = x 2 . 

(c) What do you conjecture is the value of lim f(x,y)7 

(z.3/)-K 0,0) 

(d) Is / continuous at (0, 0)? Why or why not? 

(e) Use appropriate technology to sketch both surface and contour plots of / near (0,0). 
Write several sentences to say how your plots affirm your findings in (a) - (d). 

xy 

2. Consider the function g defined by g(x, y) = ——- 77 . 

x z + y z 


(a) What is the domain of g ? 

(b) Evaluate limit of g at (0, 0) along the following paths: x = 0,y = x, and y = 2x. 

(c) What can you now say about the value of lim g(x,y)7 

(*>!/)-»•( 0 , 0 ) 

(d) Is g continuous at (0, 0)? Why or why not? 

(e) Use appropriate technology to sketch both surface and contour plots of g near (0,0). 
Write several sentences to say how your plots affirm your findings in (a) - (d). 


3. For each of the following prompts, provide an example of a function of two variables with the 
desired properties (with justification), or explain why such a function does not exist. 

(a) A function p that is defined at (0,0), but lim p(x, y) does not exist. 

(*,y)-K 0 , 0 ) 

(b) A function q that does not have a limit at (0, 0), but that has the same limiting value 
along any line y = mx as x 0. 

(c) A function r that is continuous at (0,0), but lim r(x, y) does not exist. 

(®>y)-K 0 , 0 ) 


(d) A function s such that 


lim s(x,x) = 3 and lim s(x,2x) = 6, 

(x,x)—>(0,0) ( x,2x)—>(0,0 ) 


for which lim s(x, y) exists. 

(*,!/)—>•( 0 , 0 ) 
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(e) A function t that is not defined at (1,1) but lim 

(*.!/)-►( 1 , 1 ) 


t(x,y) does exist. 
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10.2 First-Order Partial Derivatives 

Motivating Questions 

In this section, we strive to understand the ideas generated by the following important questions: 


• How are the first-order partial derivatives of a function / of the independent variables x 
and y defined? 

• Given a function / of the independent variables x and y, what do the first-order partial 
derivatives and tell us about /? 


Introduction 


The derivative plays a central role in first semester calculus because it provides important infor¬ 
mation about a function. Thinking graphically, for instance, the derivative at a point tells us the 
slope of the tangent line to the graph at that point. In addition, the derivative at a point also pro¬ 
vides the instantaneous rate of change of the function with respect to changes in the independent 
variable. 

Now that we are investigating functions of two or more variables, we can still ask how fast the 
function is changing, though we have to be careful about what we mean. Thinking graphically 
again, we can try to measure how steep the graph of the function is in a particular direction. 
Alternatively, we may want to know how fast a function's output changes in response to a change 
in one of the inputs. Over the next few sections, we will develop tools for addressing issues such 
as these these. Preview Activity 10.2 explores some issues with what we will come to call partial 
derivatives. 


Preview Activity 10.2. Let's return to the function we considered in Preview Activity 9.1. Suppose 
we take out a $18,000 car loan at interest rate r and we agree to pay off the loan in t years. The 
monthly payment, in dollars, is 


M (r, t ) 


1500r 


1 “ ( X + I2) 


- 12 1 ' 


(a) What is the monthly payment if the interest rate is 3% so that r = 0.03, and we pay the 
loan off in t = 4 years? 

(b) Suppose the interest rate is fixed at 3%. Express M as a function / off alone using r = 0.03. 
That is, let f(t) = M(0.03, t). Sketch the graph of / on the left of Figure 10.8. Explain the 
meaning of the function /. 

(c) Find the instantaneous rate of change /'(4) and state the units on this quantity. What 
information does /'(4) tell us about our car loan? What information does /'(4) tell us 
about the graph you sketched in (b)? 


feaiSMMI 
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Figure 10.8: The graphs of /(f) = M(0.03, t) and g(r) = M(r, 4). 


(d) Express M as a function of r alone, using a fixed time of t = 4. That is, let g(r) = M(r, 4). 
Sketch the graph of g on the right of Figure 10.8. Explain the meaning of the function g. 

(e) Find the instantaneous rate of change g'{ 0.03) and state the units on this quantity. What 
information does g'{ 0.03) tell us about our car loan? What information does r/(0.03) tell us 
about the graph you sketched in (d)? 




First-Order Partial Derivatives 


In Section 9.1, we studied the behavior of a function of two or more variables by considering the 
traces of the function. Recall that in one example, we considered the function / defined by 


f(x,y ) 


x 2 sin(2y) 
32 


which measures the range, or horizontal distance, in feet, traveled by a projectile launched with an 
initial speed of x feet per second at an angle y radians to the horizontal. The graph of this function 
is given again on the left in Figure 10.9. Moreover, if we fix the angle y = 0.6, we may view the 
trace fix, 0.6) as a function of x alone, as seen at right in Figure 10.9. 

Since the trace is a one-variable function, we may consider its derivative just as we did in the 
first semester of calculus. With y = 0.6, we have 


/(x, 0.6) 


sin(1.2) 
32” 


-x 


and therefore 


When x = 150, this gives 


T[/(x, 0.6)] 
ax 


sin(1.2) 

16 


x. 


d 

dx 


[/(x, 0.6)] U=i 50 


sin(1.2) 


150 


8.74 feet per feet per second, 


@©@© 


16 
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Figure 10.9: The trace with y = 0.6. 


which gives the slope of the tangent line shown on the right of Figure 10.9. Thinking of this deriva¬ 
tive as an instantaneous rate of change implies that if we increase the initial speed of the projectile 
by one foot per second, we expect the horizontal distance traveled to increase by approximately 
8.74 feet if we hold the launch angle constant at 0.6 radians. 

By holding y fixed and differentiating with respect to x, we obtain the first-order partial deriva¬ 
tive of f with respect to x. Denoting this partial derivative as f x , we have seen that 


/*(150,0.6) 


d_ 

dx 


f(x, 0 . 6 ) 1^=150 


lim ^ 15 ° + k ’ °' 6 ^ “ ^ 150, °' 6 ^ 
h — >-0 h 


More generally, we have 


fx(a,b ) = lim 
h — 


f(a + h,b)- f(a,b ) 
h 


provided this limit exists. 

In the same way, we may obtain a trace by setting, say, x = 150 as shown in Figure 10.10. This 
gives 

150 2 

/(150, y) = -^-sin(2 y), 

and therefore 

d 150 2 

-[/(150, y)] = —cos(2y). 

If we evaluate this quantity at y = 0.6, we have 


d_ 

dy 


[/(150, y)]\ y=o e 


150 2 


cos(1.2) « 509.5 feet per radian. 


Once again, the derivative gives the slope of the tangent line shown on the right in Figure 10.10. 
Thinking of the derivative as an instantaneous rate of change, we expect that the range of the 
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Figure 10.10: The trace with x = 150. 


projectile increases by 509.5 feet for every radian we increase the launch angle y if we keep the 
initial speed of the projectile constant at 150 feet per second. 

By holding x fixed and differentiating with respect to y, we obtain the first-order partial deriva¬ 
tive of f with respect to y. As before, we denote this partial derivative as f y and write 


/y(l50, 0.6) 


^-/(iSO, 2/)| y=0 .6 = lim 
dy h — 


/(150,0.6 + h) — /(150, 0.6) 
h 


As with the partial derivative with respect to x, we may express this quantity more generally at 
an arbitrary point (a, b). To recap, we have now arrived at the formal definition of the first-order 
partial derivatives of a function of two variables. 


Definition 10.3. The first-order partial derivatives of / with respect to x and y at a point (a, b) 
are, respectively. 


fx(a,b) 

fy(a,b) 


lim f(a + h,b) - f(a, b) 
h ->0 h 


and 


lim f(a,b + h) - f(a, b) 
h- 5-0 h 


provided the limits exist. 


Activity 10.3. 

Consider the function / defined by 


f(x,y) 


9 

xy 

X -\-l 
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at the point (1,2). 

(a) Write the trace f(x, 2) at the fixed value y = 2. On the left side of Figure 10.11, draw the 
graph of the trace with y = 2 indicating the scale and labels on the axes. Also, sketch 
the tangent line at the point x = 1. 


Figure 10.11: Traces of f(x, y) = 

(b) Find the partial derivative f x ( 1,2) and relate its value to the sketch you just made. 

(c) Write the trace /(l, y) at the fixed value x = 1. On the right side of Figure 10.11, draw 
the graph of the trace with x = 1 indicating the scale and labels on the axes. Also, sketch 
the tangent line at the point y = 2. 

(d) Find the partial derivative f y ( 1,2) and relate its value to the sketch you just made. 


<1 


As these examples show, each partial derivative at a point arises as the derivative of a one- 
variable function defined by fixing one of the coordinates. In addition, we may consider each par¬ 
tial derivative as defining a new function of the point (x, y ), just as the derivative f (x ) defines a 
new function of x in single-variable calculus. Due to the connection between one-variable deriva¬ 
tives and partial derivatives, we will often use Leibniz-style notation to denote partial derivatives 
by writing 


di 

dx 


(a,b) = f x (a, b), 


and 


di 

dy 


(a,b) = f y (a, b). 


To calculate the partial derivative f x 
Leibniz notation, observe that 


we hold y fixed and thus we treat y as a constant. In 


^(.x) = 1 and ^-( y ) = 0. 
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To see the contrast between how we calculate single variable derivatives and partial derivatives, 
and the difference between the notations ^ [ ] and ^ [ ], observe that 


and 


-^-[3x 2 - 2x + 3] = 3-^-[x 2 ] - 2-f-[x] + -f-[3] = 3 • 2x - 2, 
ax ax ax ax 

d . 2 i <9 r 2l <9 r n <9 r n 

— [x y - xy + 2y\ = y — [x J - y^[xj + ^[2yJ = y • 2x - y 


Thus, computing partial derivatives is straightforward: we use the standard rules of single vari¬ 
able calculus, but do so while holding one (or more) of the variables constant. 

Activity 10.4. 

(a) If we have the function / of the variables x and y and we want to find the partial deriva¬ 
tive f x , which variable do we treat as a constant? When we find the partial derivative 
f y , which variable do we treat as a constant? 

(b) If /(x, y) = 3x 3 — 2 x 2 y 5 , find the partial derivatives f x and f y . 

2 

xy 

(c) If fix , y) = —-, find the partial derivatives f x and 

x + 1 

(d) If g(r, s ) = rs cos(r), find the partial derivatives g r and g s . 

(e) Assuming f(w, x, y) = (6 w + 1) cos(3x 2 + 4xy 3 + y), find the partial derivatives f w , f x , 
and f y . 


(f) Find all possible first-order partial derivatives of q{x, t. z) = - 


xTz 


t ^.3 


+ X 


2 ' 
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Interpretations of First-Order Partial Derivatives 

Recall that the derivative of a single variable function has a geometric interpretation as the slope of 
the line tangent to the graph at a given point. Similarly, we have seen that the partial derivatives 
measure the slope of a line tangent to a trace of a function of two variables as shown in Figure 
10 . 12 . 

Now we consider the first-order partial derivatives in context. Recall that the difference quo¬ 
tient h a +D f{a) £ or a f unc q on f of a single variable x at a point where x = a tells us the average 
rate of change of / over the interval [a, a + h], while the derivative f (a) tells us the instantaneous 
rate of change of / at x = a. We can use these same concepts to explain the meanings of the partial 
derivatives in context. 

Activity 10.5. 

The speed of sound C traveling through ocean water is a function of temperature, salinity and 
depth. It may be modeled by the function 

C = 1449.2 + 4.6T - 0.055T 2 + 0.00029T 3 + (1.34 - 0.01T)(5 - 35) + 0.016D. 
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Figure 10.12: Tangent lines to two traces of the range function. 


Here C is the speed of sound in meters/second, T is the temperature in degrees Celsius, S is 
the salinity in grams/liter of water, and D is the depth below the ocean surface in meters. 

(a) State the units in which each of the partial derivatives, Ct, Cg and Cjj, are expressed 
and explain the physical meaning of each. 

(b) Find the partial derivatives Ct, Cs and Co- 

(c) Evaluate each of the three partial derivatives at the point where T = 10, S = 35 and 
D = 100. What does the sign of each partial derivatives tell us about the behavior of the 
function C at the point (10,35,100)? 


<1 


Using tables and contours to estimate partial derivatives 

Remember that functions of two variables are often represented as either a table of data or a con¬ 
tour plot. In single variable calculus, we saw how we can use the difference quotient to approxi¬ 
mate derivatives if, instead of an algebraic formula, we only know the value of the function at a 
few points. The same idea applies to partial derivatives. 

Activity 10.6. 

The wind chill, as frequently reported, is a measure of how cold it feels outside when the wind 
is blowing. In Table 10.1, the wind chill w, measured in degrees Fahrenheit, is a function of the 
wind speed v, measured in miles per hour, and the ambient air temperature T, also measured 
in degrees Fahrenheit. We thus view w as being of the form w = u>(v,T). 

(a) Estimate the partial derivative tc„(20, —10). What are the units on this quantity and 
what does it mean? 
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v\T 

-30 

-25 

-20 

-15 

-10 

-5 

0 

5 

10 

15 

20 

5 

-46 

-40 

-34 

-28 

-22 

-16 

-11 

-5 

1 

7 

13 

10 

-53 

-47 

-41 

-35 

-28 

-22 

-16 

-10 

-4 

3 

9 

15 

-58 

-51 

-45 

-39 

-32 

-26 

-19 

-13 

-7 

0 

6 

20 

-61 

-55 

-48 

-42 

-35 

-29 

-22 

-15 

-9 

-2 

4 

25 

-64 

-58 

-51 

-44 

-37 

-31 

-24 

-17 

-11 

-4 

3 

30 

-67 

-60 

-53 

-46 

-39 

-33 

-26 

-19 

-12 

-5 

1 

35 

-69 

-62 

-55 

-48 

-41 

-34 

-27 

-21 

-14 

-7 

0 

40 

-71 

-64 

-57 

-50 

-43 

-36 

-29 

-22 

-15 

-8 

-1 


Table 10.1: Wind chill as a function of wind speed and temperature. 


(b) Estimate the partial derivative wt{ 20, —10). What are the units on this quantity and 
what does it mean? 

(c) Use your results to estimate the wind chill u>(18, —10). 

(d) Use your results to estimate the wind chill w( 20, —12). 

(e) Use your results to estimate the wind chill u>(18, —12). 

<1 

Activity 10.7. 

Shown below in Figure 10.13 is a contour plot of a function /. The value of the function along 
a few of the contours is indicated to the left of the figure. 



Figure 10.13: A contour plot of /. 


(a) Estimate the partial derivative f x {— 2,-1). 

(b) Estimate the partial derivative f y (— 2,-1). 

(c) Estimate the partial derivatives f x (— 1, 2) and f y (—l, 2). 
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(d) Locate one point (x, y ) where the partial derivative f x (x, y ) = 0. 

(e) Locate one point ( x, y) where f x {x, y) < 0. 

(f) Locate one point (x, y) where f y (x, y) > 0. 

(g) Suppose you have a different function g, and you know that g( 2 , 2) = 4, g x ( 2 , 2) > 0, 
and g y ( 2, 2) > 0. Using this information, sketch a possibility for the contour g(x. y) = 4 
passing through (2, 2) on the left side of Ligure 10.14. Then include possible contours 

g{x,y) = 3 and g(x,y) = 5. 


y 















X 


12 3 4 


y 















X 


l 2 3 4 


Ligure 10.14: Plots for contours of g and h. 

(h) Suppose you have yet another function h, and you know that h( 2, 2) = 4, h x ( 2, 2) < 0, 
and h y ( 2, 2) > 0. Using this information, sketch a possible contour h{x. y) = 4 passing 
through (2, 2) on the right side of Ligure 10.14. Then include possible contours h(x, y) = 
3 and h(x, y) = 5. 
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Summary 

• If / = f(x, y) is a function of two variables, there are two first order partial derivatives of /: the 
partial derivative of / with respect to x, 


df 

dx 


{x,y) = f x (x,y) = lim 

/i—>o 


f(x + h,y) - f(x, y) 
h 


and the partial derivative of / with respect to y, 


df 

-^-(x,y) = fy(x,y) = lim 
oy h—>o 


f(x,y + h) - f(x,y) 
h 


where each partial derivative exists only at those points (x, y) for which the limit exists. 

• The partial derivative f x (a, b) tells us the instantaneous rate of change of / with respect to x at 
(x, y) = (a, b) when y is fixed at b. Geometrically, the partial derivative f x (a, b) tells us the slope 
of the line tangent to the y = b trace of the function / at the point (a, b, f(a, b)). 
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• The partial derivative f y {a, b) tells us the instantaneous rate of change of / with respect to y at 
(x, y ) = (a, b ) when x is fixed at a. Geometrically, the partial derivative f y (a, b ) tells us the slope 
of the line tangent to the x = a trace of the function / at the point (a, b, f(a. b)). 


Exercises 

1. The Heat Index, 7, (measured in apparent degrees F) is a function of the actual temperature T 
outside (in degrees F) and the relative humidity 77 (measured as a percentage). A portion of 
the table which gives values for this function, I = I (T, 77), is reproduced below: 


Ti\H 

70 

75 

80 

85 

90 

106 

109 

112 

115 

92 

112 

115 

119 

123 

94 

118 

122 

127 

132 

96 

125 

130 

135 

141 


(a) State the limit definition of the value It( 94, 75). Then, estimate It{ 94,75), and write 
one complete sentence that carefully explains the meaning of this value, including its 
units. 

(b) State the limit definition of the value 7# (94, 75). Then, estimate 7#(94, 75), and write 
one complete sentence that carefully explains the meaning of this value, including its 
units. 

(c) Suppose you are given that 7^(92,80) = 3.75 and 7#(92, 80) = 0.8. Estimate the val¬ 
ues of 7(91, 80) and 7(92, 78). Explain how the partial derivatives are relevant to your 
thinking. 

(d) On a certain day, at 1 p.m. the temperature is 92 degrees and the relative humidity is 
85%. At 3 p.m., the temperature is 96 degrees and the relative humidity 75%. What 
is the average rate of change of the heat index over this time period, and what are the 
units on your answer? Write a sentence to explain your thinking. 


2. Let f(x, y) = \xy 2 represent the kinetic energy in Joules of an object of mass x in kilograms 
with velocity y in meters per second. Let (a, b) be the point (4, 5) in the domain of /. 

(a) Calculate f x (a, b). 

(b) Explain as best you can in the context of kinetic energy what the partial derivative 


f x (a, b) = lim 

h —^0 


f{a + h,b) - f(a,b) 
h 


tells us about kinetic energy, 
(c) Calculate/ y (a, 6). 


@Q®@ 
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(d) Explain as best you can in the context of kinetic energy what the partial derivative 


f y (a,b) = lim 
h—> 0 


f(a,b + h) - f(a,b ) 
h 


tells us about kinetic energy. 

(e) Often we are given certain graphical information about a function instead of a rule. We 
can use that information to approximate partial derivatives. For example, suppose that 
we are given a contour plot of the kinetic energy function (as in Figure 10.15) instead 
of a formula. Use this contour plot to approximate f x { 4,5) and f y ( 4,5) as best you can. 
Compare to your calculations from earlier parts of this exercise. 



Figure 10.15: The graph of f(x, y) = \xy 2 . 


3. 


The temperature on an unevenly heated metal plate positioned in the first quadrant of the x-y 
plane is given by 


C(x,y) 


25 xy + 25 

(x - l) 2 + (y - l) 2 + r 


Assume that temperature is measured in degrees Celsius and that x and y are each measured 
in inches. (Note: At no point in the following questions should you expand the denominator 

of C(x,y).) 


(a) Determine |( X)J/ ) and (a . iW) . 

(b) If an ant is on the metal plate, standing at the point (2, 3), and starts walking in a direc¬ 
tion parallel to the y axis, at what rate will the temperature he is experiencing change? 
Explain, and include appropriate units. 

(c) If an ant is walking along the line y = 3, at what instantaneous rate will the temperature 
he is experiencing change when he passes the point (1, 3)? 

(d) Now suppose the ant is stationed at the point (6,3) and walks in a straight line towards 
the point (2,0). Determine the average rate of change in temperature (per unit distance 


@©@© 
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traveled) the ant encounters in moving between these two points. Explain your reason¬ 
ing carefully. What are the units on your answer? 

4. Consider the function / defined by f(x, y) = 8 — x 2 — 3 y 2 . 

(a) Determine f x (x, y ) and f y (x, y). 

(b) Find parametric equations in M 3 for the tangent line to the trace f(x, 1) at x = 2. 

(c) Find parametric equations in M 3 for the tangent line to the trace /(2, y) at y = 1. 

(d) State respective direction vectors for the two lines determined in (b) and (c). 

(e) Determine the equation of the plane that passes through the point (2,1, /(2,1)) whose 
normal vector is orthogonal to the direction vectors of the two lines found in (b) and (c). 

(f) Use a graphing utility to plot both the surface z = 8 — x 2 — 3 y 2 and the plane from (e) 
near the point (2,1). What is the relationship between the surface and the plane? 
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10.3 Second-Order Partial Derivatives 

Motivating Questions 

In this section, zve strive to understand the ideas generated by the following important questions: 


• Given a function / of two independent variables x and y, how are the second-order partial 
derivatives of / defined? 

• What do the second-order partial derivatives f xx , f yy , f xy , and f yx of a function / tell us 
about the function's behavior? 


Introduction 


Recall that for a single-variable function f, the second derivative of / is defined to be the derivative 
of the first derivative. That is, f" (x) = jj- [/' (x)], which can be stated in terms of the limit definition 
of the derivative by writing 


/"(*) 


lim f'(x + h)-f'(x) ' 
h—h 


In what follows, we begin exploring the four different second-order partial derivatives of a func¬ 
tion of two variables and seek to understand what these various derivatives tell us about the 
function's behavior. 


Preview Activity 10.3. Once again, let's consider the function / defined by f(x, y ) 


x 2 sin(2 y) 
32 


that 




Figure 10.16: The range function with traces y = 0.6 and x = 150. 

measures a projectile's range as a function of its initial speed x and launch angle y. The graph of 
this function, including traces with x = 150 and y = 0.6, is shown in Figure 10.16. 

(a) Compute the partial derivative f x and notice that f x itself is a new function of x and y. 
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(b) We may now compute the partial derivatives of f x . Find the partial derivative f xx = (f x ) x 
and evaluate f xx ( 150,0.6). 

(c) Figure 10.17 shows the trace of / with y = 0.6 with three tangent lines included. Explain 
how your result from part (b) of this preview activity is reflected in this figure. 



Figure 10.17: The trace with y = 0.6. 

(d) Determine the partial derivative f y , and then find the partial derivative f yy = [f y )y Eval¬ 
uate / ra (150,0.6). 

1000 

800 

600 

400 

200 

0.25 0.50 0.75 1.00 1.25 



Figure 10.18: More traces of the range function. 

(e) Figure 10.18 shows the trace /(150, y) and includes three tangent lines. Explain how the 
value of f yy ( 150, 0.6) is reflected in this figure. 

(f) Because f x and f y are each functions of both x and y, they each have two partial deriva¬ 
tives. Not only can we compute f xx = (f x ) x , but also f xy = (f x ) y ) likewise, in addition to 

fyy = {fy)y> but also f yx = ( f y ) x . For the range function f(x, y) = - m ^ 2y \ use your earlier 
computations of f x and f y to now determine f xy and f yx . Write one sentence to explain 
how you calculated these "mixed" partial derivatives. 
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CXI 


Second-order Partial Derivatives 

A function / of two independent variables x and y has two first order partial derivatives, f x and 
f y . As we saw in Preview Activity 10.3, each of these first-order partial derivatives has two partial 
derivatives, giving a total of four second-order partial derivatives: 

(f) =A. (df) = aV 

\Jx)x Q x \^Qx J dx 2 ' 

(f) = a_(9l'] = ^f 
\Jy)y dy \dy J dy 2 ' 

(f ) = JL (9£\ = a 2 / 

\Jx)y Qy J dydx’ 

(f) a (df\ = a^ 

\Jy)x q x ydy J dxdy " 

The first two are called unmixed second-order partial derivatives while the last two are called 
the mixed second-order partial derivatives. 

One aspect of this notation can be a little confusing. The notation 

d 2 f = d_ /df\ 

dydx dy \dx J 

means that we first differentiate with respect to x and then with respect to y; this can be expressed 
in the alternate notation f xy = {f x )y However, to find the second partial derivative 

fyx = ( fy)x 

we first differentiate with respect to y and then x. This means that 

d 2 f f d 2 f 

V. 7T - = fxyi and 77 7— = fyx- 
dydx dxdy 

Be sure to note carefully the difference between Leibniz notation and subscript notation and the 
order in which x and y appear in each. In addition, remember that anytime we compute a partial 
derivative, we hold constant the variable(s) other than the one we are differentiating with respect 
to. 

Activity 10.8. 

Find all second order partial derivatives of the following functions. For each partial derivative 
you calculate, state explicitly which variable is being held constant. 

(a) f(x,y) = x 2 y 3 

(b) f(x,y) = y cos(x) 


• fxx - 

• fyy = 

• fxy 

• fyx = 
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(c) g(s, t ) = st 3 + s 4 

(d) How many second order partial derivatives does the function h defined by h(x, y, z) = 
9 x 9 z — xyz 9 + 9 have? Find h xz and h zx . 


< 

In Preview Activity 10.3 and Activity 10.8, you may have noticed that the mixed second-order 
partial derivatives are equal. This observation holds generally and is known as Clairaut's Theo¬ 
rem. 

Clairaut's Theorem. Let / be a function of two variables for which the partial derivatives f xy 
and f yx are continuous near the point (a, b). Then 

fxyiflib') = fy X ((l,b). 


Interpreting the second-order Partial Derivatives 

Recall from single variable calculus that the second derivative measures the instantaneous rate of 
change of the derivative. This observation is the key to understanding the meaning of the second- 
order partial derivatives. 


Z /] \ I K Z , T I Z 



Figure 10.19: The tangent lines to a trace with increasing x. 

Furthermore, we remember that the second derivative of a function at a point provides us 
with information about the concavity of the function at that point. Since the unmixed second- 
order partial derivative f xx requires us to hold y constant and differentiate twice with respect to 
x, we may simply view f xx as the second derivative of a trace of / where y is fixed. As such, f xx 
will measure the concavity of this trace. 

Consider, for example, f(x, y) = sin(;r)e - ^. Figure 10.19 shows the graph of this function along 
with the trace given by y = —1.5. Also shown are three tangent lines to this trace, with increasing 
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x -values from left to right among the three plots in Figure 10.19. 

That the slope of the tangent line is decreasing as x increases is reflected, as it is in one-variable 
calculus, in the fact that the trace is concave down. Indeed, we see that f x (x, y ) = cos(x)e~ y and 
so fxx(x, y) = — s\n{x)e~ y < 0, since e~ y > 0 for all values of y, including y = —1.5. 

In the following activity, we further explore what second-order partial derivatives tell us about 
the geometric behavior of a surface. 

Activity 10.9. 

We continue to consider the function / defined by f(x, y) = sin(x)e _y . 

(a) In Figure 10.20, we see the trace of f(x, y) = sin(x)e -y that has x held constant with 
x = 1.75. Write a couple of sentences that describe whether the slope of the tangent 



Figure 10.20: The tangent lines to a trace with increasing y. 

lines to this curve increase or decrease as y increases, and, after computing f yy (x,y), 
explain how this observation is related to the value of f yy ( 1.75, y). Be sure to address 
the notion of concavity in your response. 

(b) In Figure 10.21, we start to think about the mixed partial derivative, f xy . Here, we 
first hold y constant to generate the first-order partial derivative f x , and then we hold x 
constant to compute f xy . This leads to first thinking about a trace with x being constant, 
followed by slopes of tangent lines in the y-di recti on that slide along the original trace. 
You might think of sliding your pencil down the trace with x constant in a way that 
its slope indicates (f x ) y in order to further animate the three snapshots shown in the 
figure. Based on Figure 10.21, is f xy ( 1.75, —1.5) positive or negative? Why? 

(c) Determine the formula for f xy (x, y), and hence evaluate f xy ( 1.75, —1.5). How does this 
value compare with your observations in (b)? 

(d) We know that / xx (1.75, —1.5) measures the concavity of the y = —1.5 trace, and that 
f yy (1.75, —1.5) measures the concavity of the x = 1.75 trace. What do you think the 
quantity f xy ( 1.75, —1.5) measures? 
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Figure 10.21: The trace of z = f(x, y ) = sin(x)e y with x = 1.75, along with tangent lines in the 
y-direction at three different points. 


(e) On Figure 10.21, sketch the trace with y = —1.5, and sketch three tangent lines whose 
slopes correspond to the value of f yx (x, —1.5) for three different values of x, the mid¬ 
dle of which is x = —1.5. Is f yx ( 1.75,—1.5) positive or negative? Why? What does 
f yx ( 1.75, —1.5) measure? 

<1 

Just as with the first-order partial derivatives, we can approximate second-order partial deriva¬ 
tives in the situation where we have only partial information about the function. 

Activity 10.10. 

As we saw in Activity 10.6, the wind chill w(v, T ), in degrees Fahrenheit, is a function of the 
wind speed, in miles per hour, and the air temperature, in degrees Fahrenheit. Some values of 
the wind chill are recorded in Table 10.2. 


v\T 

-30 

-25 

-20 

-15 

-10 

-5 

0 

5 

10 

15 

20 

5 

-46 

-40 

-34 

-28 

-22 

-16 

-11 

-5 

1 

7 

13 

10 

-53 

-47 

-41 

-35 

-28 

-22 

-16 

-10 

-4 

3 

9 

15 

-58 

-51 

-45 

-39 

-32 

-26 

-19 

-13 

-7 

0 

6 

20 

-61 

-55 

-48 

-42 

-35 

-29 

-22 

-15 

-9 

-2 

4 

25 

-64 

-58 

-51 

-44 

-37 

-31 

-24 

-17 

-11 

-4 

3 

30 

-67 

-60 

-53 

-46 

-39 

-33 

-26 

-19 

-12 

-5 

1 

35 

-69 

-62 

-55 

-48 

-41 

-34 

-27 

-21 

-14 

-7 

0 

40 

-71 

-64 

-57 

-50 

-43 

-36 

-29 

-22 

-15 

-8 

-1 


Table 10.2: Wind chill as a function of wind speed and temperature. 


(a) Estimate the partial derivatives wt( 20,-15), wr(20, —10), and wt{ 20,-5). Use these 
results to estimate the second-order partial wtt{ 20, —10). 
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(b) In a similar way, estimate the second-order partial w vv (20, —10). 

(c) Estimate the partial derivatives wr(20, —10), wt( 25,-10), and u>t( 15, —10), and use 
your results to estimate the partial wt v { 20, —10). 

(d) In a similar way, estimate the partial derivative w v t{ 20, —10). 

(e) Write several sentences that explain what the values wtt{ 20, —10), w vv (20, —10), and 
wtv{ 20, —10) indicate regarding the behavior of w(v, T). 


<1 

As we have found in Activities 10.9 and 10.10, we may think of f xy as measuring the "twist" 
of the graph as we increase y along a particular trace where x is held constant. In the same way, 
f yx measures how the graph twists as we increase x. If we remember that Clairaut's theorem tells 
us that f xy = f yx , we see that the amount of twisting is the same in both directions. This twisting 
is perhaps more easily seen in Figure 10.22, which shows the graph of f(x, y ) = — xy, for which 

fxy = -1- 



Figure 10.22: The graph of f(x, y) = — xy. 


Summary 

• There are four second-order partial derivatives of a function / of two independent variables x 
and y: 

fxx = (fx)x, fxy = (fx)y,fyx = ( fy)x, and f yy = ( fy)y■ 


• The unmixed second-order partial derivatives, f xx and f yy , tell us about the concavity of the 
traces. The mixed second-order partial derivatives, f xy and f yx , tell us how the graph of / 
twists. 


Exercises 

1. Shown in Figure 10.23 is a contour plot of a function / with the values of / labeled on the 
contours. The point (2,1) is highlighted in red. 
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Figure 10.23: A contour plot of f(x. y ). 


(a) Estimate the partial derivatives f x (2, 1) and f y ( 2,1). 

(b) Determine whether the second-order partial derivative f xx ( 2,1) is positive or negative, 
and explain your thinking. 

(c) Determine whether the second-order partial derivative f yy ( 2,1) is positive or negative, 
and explain your thinking. 

(d) Determine whether the second-order partial derivative f xy ( 2,1) is positive or negative, 
and explain your thinking. 

(e) Determine whether the second-order partial derivative f yx ( 2,1) is positive or negative, 
and explain your thinking. 

(f) Consider a function g of the variables x and y for which g x ( 2, 2) > 0 and g xx ( 2, 2) < 0. 
Sketch possible behavior of some contours around (2, 2) on the left axes in Figure 10.24. 


y 
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12 3 4 


y 
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12 3 4 


Figure 10.24: Plots for contours of g and h. 


(g) Consider a function h of the variables x and y for which h x { 2, 2) > 0 and h xy ( 2,2) < 0. 
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Sketch possible behavior of some contour lines around (2, 2) on the right axes in Figure 
10.24. 

2. The Heat Index, I, (measured in apparent degrees F ) is a function of the actual temperature T 
outside (in degrees F) and the relative humidity H (measured as a percentage). A portion of 
the table which gives values for this function, I (T, FI), is reproduced below: 


T \.\H 

70 

75 

80 

85 

90 

106 

109 

112 

115 

92 

112 

115 

119 

123 

94 

118 

122 

127 

132 

96 

125 

130 

135 

141 


(a) State the limit definition of the value Itt (94, 75). Then, estimate Itt (94, 75), and write 
one complete sentence that carefully explains the meaning of this value, including units. 

(b) State the limit definition of the value Ihh( 94, 75). Then, estimate Ihh{ 94, 75), and write 
one complete sentence that carefully explains the meaning of this value, including units. 

(c) Finally, do likewise to estimate Iht( 94,75), and write a sentence to explain the meaning 
of the value you found. 

3. The temperature on a heated metal plate positioned in the first quadrant of the x-y plane is 
given by 

C(x,y) = 25e-( x - 1)2 - fe - 1)3 . 

Assume that temperature is measured in degrees Celsius and that x and y are each measured 
in inches. 

(a) Determine C xx (x, y) and C yy (x, y). Do not do any additional work to algebraically sim¬ 
plify your results. 

(b) Calculate C xx (1.1.1.2). Suppose that an ant is walking past the point (1.1,1.2) along 
the line y = 1.2. Write a sentence to explain the meaning of the value of C xx (l.l, 1.2), 
including units. 

(c) Calculate C yy ( 1.1,1.2). Suppose instead that an ant is walking past the point (1.1,1.2) 
along the line x = 1.1. Write a sentence to explain the meaning of the value of C yy { 1.1,1.2), 
including units. 

(d) Determine C xy (x,y) and hence compute C xy ( 1.1,1.2). What is the meaning of this 
value? Explain, in terms of an ant walking on the heated metal plate. 

4. Let f(x, y) = 8 — x 2 — y 2 and g(x, y) = 8 — x 2 + 4 xy — y 2 . 

(a) Determine f x , f y , f xx , f yy , f xy , and f yx . 

(b) Evaluate each of the partial derivatives in (a) at the point (0,0). 
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(c) What do the values in (b) suggest about the behavior of / near (0, 0)? Plot a graph of / 
and compare what you see visually to what the values suggest. 

(d) Determine g x , 9y, g xx > 9yyr 9xy / <md 9yx- 

(e) Evaluate each of the partial derivatives in (d) at the point (0,0). 

(f) What do the values in (e) suggest about the behavior of g near (0, 0)? Plot a graph of g 
and compare what you see visually to what the values suggest. 

(g) What do the functions / and g have in common at (0, 0)? What is different? What do 
your observations tell you regarding the importance of a certain second-order partial 
derivative? 


5. Let f(x. y) = \xy 2 represent the kinetic energy in Joules of an object of mass x in kilograms 
with velocity y in meters per second. Let (a, b ) be the point (4, 5) in the domain of /. 

< 9 2 / 

(a) Calculate —^ at the point (a, b). Then explain as best you can what this second order 
partial derivative tells us about kinetic energy. 


< 9 2 / 

(b) Calculate at the point (a, b). Then explain as best you can what this second order 


partial derivative tells us about kinetic energy. 

< 9 2 / 

(c) Calculate —— at the point (a, b). Then explain as best you can what this second order 

oyox 

partial derivative tells us about kinetic energy. 

d 2 f 


(d) Calculate 


dxdy 


at the point (a, b). Then explain as best you can what this second order 


partial derivative tells us about kinetic energy. 
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10.4 Linearization: Tangent Planes and Differentials 

Motivating Questions 

In this section, zve strive to understand the ideas generated by the following important questions: 

• What does it mean for a function of two variables to be locally linear at a point? 

• How do we find the equation of the plane tangent to a locally linear function at a point? 

• What does it mean to say that a multivariable function is differentiable ? 

• What is the differential of a multivariable function of two variables and what are its uses? 


Introduction 

One of the central concepts in single variable calculus is that the graph of a differentiable function, 
when viewed on a very small scale, looks like a line. We call this line the tangent line and measure 
its slope with the derivative. In this section, we will extend this concept to functions of several 
variables. 

Let's see what happens when we look at the graph of a two-variable function on a small scale. 
To begin, let's consider the function / defined by 

f(x,y) = 6 — Y~y 2 , 

whose graph is shown in Figure 10.25. 



-4 


-2 


Figure 10.25: The graph of f(x, y) = 6 — x 2 /2 — y 2 . 

We choose to study the behavior of this function near the point (xq, yo ) = (1,1). In particular, 
we wish to view the graph on an increasingly small scale around this point, as shown in the two 
plots in Figure 10.26 
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Figure 10.26: The graph of f(x, y) = 6 — x 2 /2 — y 2 . 


Just as the graph of a differentiable single-variable function looks like a line when viewed on 
a small scale, we see that the graph of this particular two-variable function looks like a plane, as 
seen in Figure 10.27. In the following preview activity, we explore how to find the equation of this 
plane. 



Figure 10.27: The graph of f(x, y) = 6 — x 2 /2 — y 2 . 

In what follows, we will also use the important fact 1 that the plane passing through (xo, yo . zq ) 
may be expressed in the form z = zq + aix — xq) + b(y — yo), where a and b are constants. 


: As we saw in Section 9.5, the equation of a plane passing through the point ( xo , y o, zo) may be written in the form 
A(x — xo) + B(y — yo) + C(z — zo) = 0. If the plane is not vertical, then C ^ 0, and we can rearrange this and hence 
write C(z — zo) = — A(x — xo) — B(y — yo) and thus 

Z = Z 0 - ^(x — Xo) — B(y-y 0 ) 

= z 0 + a(x - Xo) + b(y - y 0 ) 


where a = — A/C and b = —B/C, respectively. 
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Preview Activity 10.4. Let f(x, y) = 6 — ^ — y 2 , and let (xo, yo) = (L 1). 

(a) Evaluate /(x,y) = 6 — ^ — y 2 and its partial derivatives at (xo,yo); that is, find /(1,1), 
f x ( 1,1), and f y ( 1,1). 

(b) We know one point on the tangent plane; namely, the z-value of the tangent plane agrees 
with the z -value on the graph of f(x, y) = 6 — %— y 2 at the point (xo, yo)- In other words, 
both the tangent plane and the graph of the function / contain the point (xo, yo,zo). Use 
this observation to determine zq in the expression z = zq + a(x — xq) + b(y — yo)- 

(c) Sketch the traces of f(x, y) = 6 — ^ — y 2 for y = yo = 1 and x = xo = 1 below in Figure 
10.28. 


5.0 


4.5 


4.0 


2 = f(- 

'A) 














X 


5.0 


4.5 


4.0 


Z = /( 

,y) 














y 


0.5 1.0 1.5 0.5 1.0 1.5 


Figure 10.28: The traces of /(x, y) with y = yo = 1 and x = xq = 1. 


(d) Determine the equation of the tangent line of the trace that you sketched in the previous 
part with y = 1 (in the x direction) at the point xq = 1. 




Figure 10.29: The traces of f(x, y) and the tangent plane. 
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(e) Figure 10.29 shows the traces of the function and the traces of the tangent plane. Explain 
how the tangent line of the trace of /, whose equation you found in the last part of this 
activity, is related to the tangent plane. How does this observation help you determine the 
constant a in the equation for the tangent plane z = zq + a(x — xq) + b(y — yo)? (Hint: How 
do you think f x (x o, yo) should be related to z x (xq, yo)?) 

(f) In a similar way to what you did in (d), determine the equation of the tangent line of the 
trace with x = 1 at the point yo = 1- Explain how this tangent line is related to the tangent 
plane, and use this observation to determine the constant b in the equation for the tangent 
plane z = zo + a(x — .To) + b(y — yo). (Hint: How do you think f y ( xq, yo) should be related 

to z y (x 0 ,yo)?) 

(g) Finally, write the equation z = zq + a[x — xq) + b(y — yo) of the tangent plane to the graph 

of f(x,y) = 6 — x 2 /2 — y 2 at the point (t 0 , yo) = (LI). 


<1 


The tangent plane 

Before stating the formula for the equation of the tangent plane at a point for a general function 
/ = f(x, y), we need to discuss a mild technical condition. As we have noted, when we look at 
the graph of a single-variable function on a small scale near a point to, we expect to see a line; in 
this case, we say that / is locally linear near xo since the graph looks like a linear function locally 
around to- Of course, there are functions, such as the absolute value function given by /(x) = |x|, 
that are not locally linear at every point. In single-variable calculus, we learn that if the derivative 
of a function exists at a point, then the function is guaranteed to be locally linear there. 

In a similar way, we say that a two-variable function / is locally linear near (xo,yo) provided 
that the graph of / looks like a plane when viewed on a small scale near (xo, yo)- There are, of 
course, functions that are not locally linear at some points (xo, yo)- However, it turns out that if 
the first-order partial derivatives, f x and f y , are continuous near (xo, yo), then / is locally linear at 
(xo, yo) and the graph looks like a plane, which we call the tangent plane, when viewed on a small 
scale. Moreover, when a function is locally linear at a point, we will also say it is differentiable at 
that point. 

If / is a function of the independent variables x and y and both f x and f y exist and are contin¬ 
uous in an open disk containing the point (xq, yo), then / is differentiable at ( xq , yo)- 


So, whenever a function z = f(x,y) is differentiable at a point (xo,yo), it follows that the 
function has a tangent plane at (xo, yo)- Viewed up close, the tangent plane and the function are 
then virtually indistinguishable. In addition, as in Preview Activity 10.4, we find the following 
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general formula for the tangent plane. 

If f(x, y) has continuous first-order partial derivatives, then the equation of the plane tangent 
to the graph of / at the point {x 0 ,y 0 , f(x 0l y 0 )) is 

Z = f(xo,yo) + fx(x 0 ,y 0 )(x - x 0 ) + fy{x 0 ,y 0 )(y - y 0 ). (10.1) 


Finally, one important note about the form of the equation for the tangent plane, z = f(x o , IJo ) + 
fx(xo,yo)(x — xq) + f y (x o, yo)(y — yo)- Say, for example, that we have the particular tangent plane 
z = 7— 2(x— 3)+4(y+l). Observe that we can immediately read from this form that f x (3, —1) = —2 
and f y ( 3, —1) = 4; furthermore, / ;r (8. —1) = —2 is the slope of the trace to both / and the tangent 
plane in the x -di recti on at (—3,1). In the same way, f y ( 3, —1) = 4 is the slope of the trace of both 
/ and the tangent plane in the y-di recti on at (3, —1). 

Activity 10.11. 

Find the equation of the tangent plane to f(x, y) = x 2 y at the point (1, 2). 

<1 


Linearization 

In single variable calculus, an important use of the tangent line is to approximate the value of a 
differentiable function. Near the point xq, the tangent line to the graph of / at x'o is close to the 
graph of / near xq, as shown in Figure 10.30. 




Figure 10.30: The linearization of the single-variable function f(x). 

In this single-variable setting, we let L denote the function whose graph is the tangent line, 
and thus 

L(x) = f(x o) + f'(x 0 )(x - x 0 ) 
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Furthermore, observe that f(x) ~ L(x) near xq. We call L the linearization of /. 

In the same way, the tangent plane to the graph of a differentiable function z = f(x, y ) at 
a point ( xo,yo ) provides a good approximation of f(x,y) near (xoWo)- Here, we define the lin¬ 
earization, L, to be the two-variable function whose graph is the tangent plane, and thus 

L(x,y) = /(x 0 , 2 / 0 ) + fx{x 0 ,y 0 )(x - x 0 ) + f y (x 0 ,yo)(y ~ Vo)- 

Finally, note that f(x, y) « L[x, y) for points near (xq, ya)■ This is illustrated in Figure 10.31. 



Figure 10.31: The linearization of f(x, y). 


Activity 10.12. 

In what follows, we find the linearization of several different functions that are given in alge¬ 
braic, tabular, or graphical form. 

(a) Find the linearization L(x, y) for the function g defined by 

g(x,y) = x 

x z + y z 

at the point (1, 2). Then use the linearization to estimate the value of 5 ( 0 . 8 , 2.3). 

(b) Table 10.3 provides a collection of values of the wind chill w(v, T), in degrees Fahren¬ 
heit, as a function of wind speed, in miles per hour, and temperature, also in degrees 
Fahrenheit. 

Use the data to first estimate the appropriate partial derivatives, and then find the 
linearization L(v,T) at the point (25, —10). Finally, use the linearization to estimate 

w{ 25, -12), ic(23, -10), and w( 23, -12). 

(c) Figure 10.32 gives a contour plot of a differentiable function /. 

After estimating appropriate partial derivatives, determine the linearization L(x,y) at 
the point ( 2 , 1 ), and use it to estimate /( 2 . 2 , 1 ), /( 2 , 0 . 8 ), and /( 2 . 2 , 0 . 8 ). 
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v\T 

-30 

-25 

-20 

-15 

-10 

-5 

0 

5 

10 

15 

20 

5 

-46 

-40 

-34 

-28 

-22 

-16 

-11 

-5 

1 

7 

13 

10 

-53 

-47 

-41 

-35 

-28 

-22 

-16 

-10 

-4 

3 

9 

15 

-58 

-51 

-45 

-39 

-32 

-26 

-19 

-13 

-7 

0 

6 

20 

-61 

-55 

-48 

-42 

-35 

-29 

-22 

-15 

-9 

-2 

4 

25 

-64 

-58 

-51 

-44 

-37 

-31 

-24 

-17 

-11 

-4 

3 

30 

-67 

-60 

-53 

-46 

-39 

-33 

-26 

-19 

-12 

-5 

1 

35 

-69 

-62 

-55 

-48 

-41 

-34 

-27 

-21 

-14 

-7 

0 

40 

-71 

-64 

-57 

-50 

-43 

-36 

-29 

-22 

-15 

-8 

-1 


Table 10.3: Wind chill as a function of wind speed and temperature. 



Figure 10.32: A contour plot of f(x, y ). 


Differentials 

As we have seen, the linearization L(x, y) enables us to estimate the value of f(x, y) for points 
(x, y) near the base point (xq, yo). Sometimes, however, we are more interested in the change in / 
as we move from the base point (xo,yo) to another point (x,y). 

Figure 10.33 illustrates this situation. Suppose we are at the point (xo, yo), and we know the 
value /(.Toj yo) of / at (xo, yo)- If we consider the displacement (Ax, Ay) to a new point (x, y) = 
(xo + Ax, yo + Ay), we would like to know how much the function has changed. We denote this 
change by A/, where 


A/ = f(x,y) - f(x 0 ,y 0 ). 


A simple way to estimate the change Af is to approximate it by df, which represents the change 
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Figure 10.33: The differential df measures the approximate change in f(x, y ). 


in the linearization L(x, y) as we move from ( xq . ijo) to (x, y). This gives 

A/ « df = L(x, y) - f(x 0 , y 0 ) 

= [fix o, y 0 ) + f x (x o, yo)(x - x 0 ) + f y (x 0 , yo)(y ~ Vo)] ~ f(xo,yo) 
= fx(x o, yo)Ax + f y (x o, y 0 )Ay. 


For consistency, we will denote the change in the independent variables as dx 
and thus 

A f ~ df = f x (x o, y 0 ) dx + fy(x 0 , y 0 ) dy. 

Expressed equivalently in Leibniz notation, we have 


df 


dj_ 

dx 


dx + 


9 / 

dy 


dy. 


Ax and dy = Ay, 

( 10 . 2 ) 

(10.3) 


We call the quantities dx, dy, and df differentials, and we think of them as measuring small 
changes in the quantities x, y, and /. Equations (10.2) and (10.3) express the relationship between 
these changes. Equation (10.3) resembles an important idea from single-variable calculus: when y 
depends on x, it follows in the notation of differentials that 


dy = y dx 


<fy_ 

dx 


dx. 


We will illustrate the use of differentials with an example. Suppose we have a machine that 
manufactures rectangles of width x = 20 cm and height y = 10 cm. However, the machine isn't 
perfect, and therefore the width could be off by dx = Ax = 0.2 cm and the height could be off by 
dy = Ay = 0.4 cm. 

The area of the rectangle is 

A{x,y) = xy, 

so that the area of a perfectly manufactured rectangle is A(20,10) = 200 square centimeters. Since 
the machine isn't perfect, we would like to know how much the area of a given manufactured 
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rectangle could differ from the perfect rectangle. We will estimate the uncertainty in the area 
using (10.2), and find that 

A A ~ cLA = A x ( 20,10) dx + A y ( 20,10) dy. 

Since A x = y and A y = x, we have 

A A^dA= 10 dx + 20 dy = 10 • 0.2 + 20 • 0.4 = 10. 

That is, we estimate that the area in our rectangles could be off by as much as 10 square centime¬ 
ters. 

Activity 10.13. 

The questions in this activity explore the differential in several different contexts. 

(a) Suppose that the elevation of a landscape is given by the function h, where we addi¬ 
tionally know that h{ 3,1) = 4.35, h x ( 3,1) = 0.27, and h y ( 3,1) = —0.19. Assume that x 
and y are measured in miles in the easterly and northerly directions, respectively, from 
some base point (0, 0). 

Your GPS device says that you are currently at the point (3,1). However, you know 
that the coordinates are only accurate to within 0.2 units; that is, dx = Ax = 0.2 and 
dy = Ay = 0.2. Estimate the uncertainty in your elevation using differentials. 

(b) The pressure, volume, and temperature of an ideal gas are related by the equation 

P = P{T,V) = 8.31 T/V, 

where P is measured in kilopascals, V in liters, and T in kelvin. Find the pressure 
when the volume is 12 liters and the temperature is 310 K. Use differentials to estimate 
the change in the pressure when the volume increases to 12.3 liters and the temperature 
decreases to 305 K. 

(c) Refer to Table 10.3, the table of values of the wind chill w(v, T), in degrees Fahrenheit, 
as a function of temperature, also in degrees Fahrenheit, and wind speed, in miles per 
hour. 

Suppose your anemometer says the wind is blowing at 25 miles per hour and your 
thermometer shows a reading of —15° degrees. However, you know your thermometer 
is only accurate to within 2° degrees and your anemometer is only accurate to within 
3 miles per hour. What is the wind chill based on your measurements? Estimate the 
uncertainty in your measurement of the wind chill. 


<1 


Summary 

• A function / of two independent variables is locally linear at a point (xo,yo) if the graph of 
/ looks like a plane as we zoom in on the graph around the point (xo,yo)- I n this case, the 
equation of the tangent plane is given by 

z = f(x 0 ,y 0 ) + f x (x 0 ,y 0 )(x - x 0 ) + f y (x 0 , y 0 ){y - yo)- 
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• The tangent plane L(x, y) = /(x 0 , y 0 ) + f x {x 0 , y 0 )(x - x 0 ) + f y (x 0 , y 0 )(y - y 0 ), when considered 
as a function, is called the linearization of a differentiable function / at (xo, yo) and may be used 
to estimate values of f(x, y); that is, f(x, y) ~ L(x, y) for points (x, y) near (xo, yo). 

• A function / of two independent variables is differentiable at (xo, yo) provided that both f x and 
f y exist and are continuous in an open disk containing the point (xo, yo). 

• The differential df of a function / = /(x, y) is related to the differentials dx and dy by 

df = f x {x o, y 0 )dx + f y (x 0 ,y 0 )dy. 

We can use this relationship to approximate small changes in / that result from small changes 
in x and y. 


Exercises 

1. Let / be the function defined by /(x, y) = x 1 / 3 y 1 / 3 , whose graph is shown in Figure 10.34. 



Figure 10.34: The surface for /(x, y) = x 1 / 3 y 1 / 3 . 


(a) Determine 

lim /(° + M) ~ /(0,0) 

h—h 

What does this limit tell us about f x ( 0,0)? 

(b) Note that /(x, y) = /(y, x), and this symmetry implies that f x ( 0, 0) = f y ( 0,0). So both 
partial derivatives of / exist at (0,0). A picture of the surface defined by / near (0,0) is 
shown in Figure 10.34. Based on this picture, do you think / is locally linear at (0, 0)? 
Why? 

(c) Show that the curve where x = y on the surface defined by / is not differentiable at 0. 
What does this tell us about the local linearity of / at (0,0)? 
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2 

(d) Is the function / defined by f(x, y) = -jz+i locally linear at (0,0)? Why or why not? 

2. Let g be a function that is differentiable at (—2, 5) and suppose that its tangent plane at this 
point is given by z = —7 + 4(x + 2) — 3 (y — 5). 

(a) Determine the values of g(— 2,5), g x (— 2,5), and g y (— 2,5). Write one sentence to explain 
your thinking. 

(b) Estimate the value of g(— 1.8,4.7). Clearly show your work and thinking. 

(c) Given changes of dx = —0.34 and dy = 0.21, estimate the corresponding change in g 
that is given by its differential, dg. 

(d) Suppose that another function h is also differentiable at (—2, 5), but that its tangent 
plane at (—2, 5) is given by 3x + 2y — 4z = 9. Determine the values of h(— 2, 5), h x (— 2, 5), 
and h y (— 2, 5), and then estimate the value of h(— 1.8,4.7). Clearly show your work and 
thinking. 

3. In the following questions, we determine and apply the linearization for several different func¬ 
tions. 

(a) Find the linearization L(x, y) for the function / defined by f(x, y) = cos(x)(2e 2y + e -2y ) 
at the point (xo,yo) = (0 j 0)- Hence use the linearization to estimate the value of 
/(0.1, 0.2). Compare your estimate to the actual value of /(0.1, 0.2). 

(b) The Heat Index, I, (measured in apparent degrees F) is a function of the actual temper¬ 
ature T outside (in degrees F) and the relative humidity H (measured as a percentage). 
A portion of the table which gives values for this function, I = I(T, H ), is provided 
below: 


Ti\H -> 

70 

75 

80 

85 

90 

106 

109 

112 

115 

92 

112 

115 

119 

123 

94 

118 

122 

127 

132 

96 

125 

130 

135 

141 


Suppose you are given that 7^(94,75) = 3.75 and /#(94,75) = 0.9. Use this given 
information and one other value from the table to estimate the value of I (93.1, 77) using 
the linearization at (94, 75). Using proper terminology and notation, explain your work 
and thinking. 

(c) Just as we can find a local linearization for a differentiable function of two variables, 
we can do so for functions of three or more variables. By extending the concept of 
the local linearization from two to three variables, find the linearization of the function 
h(x, y, z) = e 2x (y + z 2 ) at the point (xo, yo, zo) = (0,1, —2). Then, use the linearization 
to estimate the value of A(—0.1, 0.9, —1.8). 

4. In the following questions, we investigate two different applied settings using the differential. 
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(a) Let / represent the vertical displacement in centimeters from the rest position of a string 
(like a guitar string) as a function of the distance x in centimeters from the fixed left end 
of the string and y the time in seconds after the string has been plucked. 2 A simple 
model for / could be 

f(x,y) = cos(x)sin(2 y). 

Use the differential to approximate how much more this vibrating string is vertically 
displaced from its position at (a, b) = (f, f) if we decrease a by 0.01 cm and increase 
the time by 0.1 seconds. Compare to the value of / at the point (| — 0.01,|+0.l). 

(b) Resistors used in electrical circuits have colored bands painted on them to indicate the 
amount of resistance and the possible error in the resistance. When three resistors, 
whose resistances are R\, R-2, and Il>„ are connected in parallel, the total resistance R is 
given by 

i _ 1 1 1 

R R\ R2 R3 

Suppose that the resistances are Ri = 2511, /A = 4011, and R 3 = 5011. Find the total 
resistance R. 

If you know each of R.\, II2, and II3 with a possible error of 0.5%, estimate the maximum 
error in your calculation of R. 


2 An interesting video of this can be seen at https : //www . youtube . com/watch?v=TKF6nFzpHBUA. 
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10.5 The Chain Rule 

Motivating Questions 

In this section, we strive to understand the ideas generated by the following important questions: 

• What is the Chain Rule and how do we use it to find a derivative? 

• How can we use a tree diagram to guide us in applying the Chain Rule? 


Introduction 


In single-variable calculus, we encountered situations in which some quantity z depends on y and, 
in turn, y depends on x. A change in x produces a change in y, which consequently produces a 
change in z. Using the language of differentials that we saw in the previous section, these changes 
are naturally related by 

dz = dy and dy = ^ dx. 
dy dx 

In terms of instantaneous rates of change, we then have 


dz 


dz dy 
dy dx 


dx 


dz 

— dx 
dx 


and thus 

dz dz dy 
dx dy dx 

This most recent equation we call the Chain Ride. 

In the case of a function / of two variables where z = fix. y), it might be that both x and y 
depend on another variable t. A change in t then produces changes in both x and y, which then 
cause z to change. In this section we will see how to find the change in z that is caused by a 
change in t, leading us to multivariable versions of the Chain Rule involving both regular and 
partial derivatives. 


Preview Activity 10.5. Suppose you are driving around in the x-y plane in such a way that your 
position r(f) at time t is given by function 

r(i) = (x(t), y(t)) = (2 - t 2 ,t 3 + 1). 


The path taken is shown on the left of Figure 10.35. 

Suppose, furthermore, that the temperature at a point in the plane is given by 

T(x,y ) = 10 - \x 2 - \y 2 , 

2 5 

and note that the surface generated by T is shown on the right of Figure 10.35. Therefore, as time 
passes, your position (x(t),y(t)) changes, and, as your position changes, the temperature T[x. y) 
also changes. 
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Figure 10.35: At left, your position in the plane; at right, the corresponding temperature. 


(a) The position function r provides a parameterization x = x(t) and y = y(t) of the position 
at time t. By substituting x(t) for x and y(t) for y in the formula for T, we can write 
T = T(x(t),y(t)) as a function of t. Make these substitutions to write T as a function of 
t and then use the Chain Rule from single variable calculus to find . (Do not do any 
algebra to simplify the derivative, either before taking the derivative, nor after.) 

(b) Now we want to understand how the result from part (a) can be obtained from T as a 
multivariable function. Recall from the previous section that small changes in x and y 
produce a change in T that is approximated by 

AT « T x Ax + T y Ay. 


The Chain Rule tells us about the instantaneous rate of change of T, and this can be found 
as 


, AT 
lim —— 
Ai-s-0 At 


= lim 

Ai->0 


T x Ax + TyAy 
At 


(10.4) 


Use equation (10.4) to explain why the instantaneous rate of change of T that results from 
a change in t is 


dT _ dT dx dT dy 
dt dx dt dy dt' 


(10.5) 


(c) Using the original formulas for T, x, and y in the problem statement, calculate all of the 
derivatives in Equation (10.5) (with T x and T y in terms of x and y, and x' and y' in terms 
of t), and hence write the right-hand side of Equation (10.5) in terms of x, y, and t. 


(d) Compare the results of parts (a) and (c). Write a couple of sentences that identify specifi¬ 
cally how each term in (c) relates to a corresponding terms in (a). This connection between 
parts (a) and (c) provides a multivariable version of the Chain Rule. 


@©@© 
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The Chain Rule 

As Preview Activity 10.3 suggests, the following version of the Chain Rule holds in general. 

Let z = fix, y ), where / is a differentiable function of the independent variables x and y, and 
let x and y each be differentiable functions of an independent variable t. Then 

dz dz dx dz dy 
dt dx dt dy dt' 


It is important to note the differences among the derivatives in (10.6). Since z is a function 
of the two variables x and y, the derivatives in the Chain Rule for z with respect to x and y are 
partial derivatives. However, since x = x(t) and y = y{t) are functions of the single variable t, 
their derivatives are the standard derivatives of functions of one variable. When we compose z 
with x(t) and y(t), we then have z as a function of the single variable t, making the derivative of z 
with respect to t a standard derivative from single variable calculus as well. 

To understand why this Chain Rule works in general, suppose that some quantity z depends 
on x and y so that 

dz dz 

dz = — dx + — dy. (10.7) 

dx dy 

Next, suppose that x and y each depend on another quantity t, so that 


, dx 
dx = — dt 
dt 


and 


dy = t dt 


( 10 . 8 ) 


Combining Equations (10.7) and (10.8), we find that 


dz 


dz dx dzdy 
dx dt ^ dy dt 


dz 

dt 


dt, 


which is the Chain Rule in this particular context, as expressed in Equation 10.6. 

Activity 10.14. 

In the following questions, we apply the recently-developed Chain Rule in several different 
contexts. 


(a) Suppose that we have a function 2 defined by z(x, y) = x 2 + xy 3 . In addition, suppose 
that x and y are restricted to points that move around the plane by following a circle of 
radius 2 centered at the origin that is parameterized by 


x(t) = 2cos(t), and y(t) = 2sin(i). 

Use the Chain Rule to find the resulting instantaneous rate of change . 
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(b) Suppose that the temperature on a metal plate is given by the function T with 

T(x,y) = 100- (x 2 + 4 y 2 ), 

where the temperature is measured in degrees Fahrenheit and x and y are each mea¬ 
sured in feet. 

i. Find T x and T y . What are the units on these partial derivatives? 

ii. Suppose an ant is walking along the x-axis at the rate of 2 feet per minute toward 
the origin. When the ant is at the point (2, 0), what is the instantaneous rate of 
change in the temperature dT/dt that the ant experiences. Include units on your 
response. 

iii. Suppose instead that the ant walks along an ellipse with x = 6cos(f) and y = 
3 sin(f), where t is measured in minutes. Find ^ratt = 7r/6, f = 7r/4, and t = n/3. 
What does this seem to tell you about the path along which the ant is walking? 

(c) Suppose that you are walking along a surface whose elevation is given by a function 
/. Furthermore, suppose that if you consider how your location corresponds to points 
in the x-y plane, you know that when you pass the point (2,1), your velocity vector is 
v = (—1,2). If some contours of / are as shown in Figure 10.36, estimate the rate of 
change df /dt when you pass through (2,1). 



Figure 10.36: Some contours of /. 


CXI 


Tree Diagrams 

Up to this point, we have applied the Chain Rule to situations where we have a function 2 of 
variables x and y, with both x and y depending on another single quantity t. We may apply 
the Chain Rule, however, when x and y each depend on more than one quantity, or when 2 is a 
function of more than two variables. It can be challenging to keep track of all the dependencies 
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among the variables, and thus a tree diagram can be a useful tool to organize our work. For 
example, suppose that z depends on x and y, and x and y both depend on t. We may represent 
these relationships using the tree diagram shown at left Figure 10.37. We place the dependent 
variable at the top of the tree and connect it to the variables on which it depends one level below. 
We then connect each of those variables to the variable on which each depends. 


2 


x y 


t t 



Figure 10.37: A tree diagram illustrating dependencies. 

To represent the Chain Rule, we label every edge of the diagram with the appropriate deriva¬ 
tive or partial derivative, as seen at right in Figure 10.37. To calculate an overall derivative accord¬ 
ing to the Chain Rule, we construct the product of the derivatives along all paths connecting the 
variables and then add all of these products. For example, the diagram at right in 10.37 illustrates 
the Chain Rule 

dz dz dx dz dy 
dt dx dt + dy dt' 

Activity 10.15. 

(a) Figure 10.38 shows the tree diagram we construct when (a) z depends on w, x, and y, 
(b) w, x, and y each depend on u and v, and (c) u and v depend on t. 

i. Label the edges with the appropriate derivatives. 

dz 

ii. Use the Chain Rule to write —. 

dt 

(b) Suppose that z = x 2 — 2 xy 2 and that 

x = r cos(0) 
y = r sin(0). 

i. Construct a tree diagram representing the dependencies of z on x and y and x and 
y on r and 6. 

ii. Use the tree diagram to find 

iii. Now suppose that r = 3 and 6 = 7t/6. Find the values of x and y that correspond to 
these given values of r and 9, and then use the Chain Rule to find the value of the 
partial derivative |( 3 ,|)- 


@®®@ 
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t 


t 


Figure 10.38: Three levels of dependencies 
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Summary 

• The Chain Rule is a tool for differentiating a composite for functions. In its simplest form, it 
says that if f(x, y) is a function of two variables and x(t) and y(t) depend on t, then 

df df dx df dy 
dt dx dt dy dt' 

• A tree diagram can be used to represent the dependence of variables on other variables. By 
following the links in the tree diagram, we can form chains of partial derivatives or derivatives 
that can be combined to give a desired partial derivative. 


Exercises 

1. Find the indicated derivative. In each case, state the version of the Chain Rule that you are 
using. 

(a) if f(x, y) = 2 x 2 y, x = cos(£), and y = ln(i). 

( b ) my, if f(x, y ) = 2 x 2 y, x = w + z 2 , and y = 

(c) if f(x, y, z) = 2 x 2 y + z 3 , x = u — v + 2 w, y = w2 v — u 3 , and z = u 2 — v 

2. Let z = u 2 — v 2 and suppose that 


@©@© 


u = e x cos (y) 
v = e x sin(y) 
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(a) Find the values of u and v that correspond to x = 0 and y = 2tt/3. 

(b) Use the Chain Rule to find the general partial derivatives 


dz 

dx 


and 


dz 

dy 


and then determine both || 


(o,f) 


and | 


(o,f )■ 


3. Suppose that T = x 2 + y 2 — 2z where 


x = p sin(0) cos (9) 
y = psin((^)) sin(0) 
z = pcos(cj>) 


(a) Construct a tree diagram representing the dependencies among the variables. 

(b) Apply the chain rule to find the partial derivatives 

dT dT dT 

~dp'W Snd ~d9' 

4. Suppose that the temperature on a metal plate is given by the function T with 

T(x,y) = 100- (x 2 + 4y 2 ), 

where the temperature is measured in degrees Fahrenheit and x and y are each measured in 
feet. Now suppose that an ant is walking on the metal plate in such a way that it walks in a 
straight line from the point (1, 4) to the point (5, 6). 

(a) Find parametric equations (x(t),y(t)) for the ant's coordinates as it walks the line from 

(1,4) to (5,6). 

(b) What can you say about ^ and ^ for every value of t ? 

(c) Determine the instantaneous rate of change in temperature with respect to t that the ant 
is experiencing at the moment it is halfway from (1,4) to (5, 6), using your parametric 
equations for x and y. Include units on your answer. 

5. There are several proposed formulas to approximate the surface area of the human body. One 
model 1 uses the formula 

A(h,w) = 0 . 0072 / i °' 725 u >°’ 425 , 

where A is the surface area in square meters, h is the height in centimeters, and w is the weight 
in kilograms. 

3 DuBois D, DuBois DF. A formula to estimate the approximate surface area if height and weight be known. Arch Int 
Med 1916;17:863-71. 
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Since a person's height h and weight w change over time, h and w are functions of time t. 
Let us think about what is happening to a child whose height is 60 centimeters and weight is 9 
kilograms. Suppose, furthermore, that h is increasing at an instantaneous rate of 20 centimeters 
per year and w is increasing at an instantaneous rate of 5 kg per year. 

Determine the instantaneous rate at which the child's surface area is changing at this point in 
time. 

6 . Let z = f(x, y) = 50 — (x + l ) 2 — (y + 3 ) 2 and z = h(x, y) = 24 — 2x — 6 y. 

Suppose a person is walking on the surface z = f(x, y) in such a way that she walks the curve 
which is the intersection of / and h. 

(a) Show that x(t) = 4cos(f) and y(t) = 4sin(f) is a parameterization of the "shadow" in 
the x-y plane of the curve that is the intersection of the graphs of / and h. 

(b) Use the parameterization from part (a) to find the instantaneous rate at which her height 
is changing with respect to time at the instant t = 2ir/3. 

7. The voltage V (in volts) across a circuit is given by Ohm's Law: V = IR, where I is the current 
(in amps) in the circuit and R is the resistance (in ohms). Suppose we connect two resistors 
with resistances R\ and Ro in parallel as shown in Ligure 10.39. The total resistance R in the 
circuit is then given by 

1 _ i 1 

R R.\ R 2 

(a) Assume that the current, I, and the resistances, R,\ 
and f? 2 , are changing over time, t. Use the Chain 
Rule to write a formula for 

(b) Suppose that, at some particular point in time, we 
measure the current to be 3 amps and that the current 
is increasing at ^5 amps per second, while resistance 
R.\ is 2 ohms and decreasing at the rate of 0.2 ohms 
per second and R 2 is 1 ohm and increasing at the rate 
of 0.5 ohms per second. At what rate is the voltage 
changing at this point in time? 



Ligure 10.39: Resistors in parallel. 
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10.6 Directional Derivatives and the Gradient 

Motivating Questions 

In this section, we strive to understand the ideas generated by the following important questions: 

• The partial derivatives of a function / tell us the rate of change of / in the direction of the 
coordinate axes. How can we measure the rate of change of / in other directions? 

• What is the gradient of a function and what does it tell us? 


Introduction 

The partial derivatives of a function tell us the instantaneous rate at which the function changes as 
we hold all but one independent variable constant and allow the remaining independent variable 
to change. It is natural to wonder how we can measure the rate at which a function changes in 
directions other than parallel to a coordinate axes. In what follows, we investigate this question, 
and see how the rate of change in any given direction is connected to the rates of change given by 
the standard partial derivatives. 

Preview Activity 10.6. Let's consider the function / defined by 

f(x,y) = 30 — x 2 — ^y 2 , 

and suppose that / measures the temperature, in degrees Celsius, at a given point in the plane, 
where x and y are measured in feet. Assume that the positive x-axis points due east, while the 
positive //-axis points due north. A contour plot of / is shown in Figure 10.40 

6 
5 
4 
3 
2 
1 


1 2 3 4 5 6 



Figure 10.40: A contour plot of /(x, y) = 30 — x 2 — \y 2 . 
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(a) Suppose that a person is walking due east, and thus parallel to the .r-axis. At what instan¬ 
taneous rate is the temperature changing at the moment she passes the point (2,1)? What 
are the units on this rate of change? 

(b) Next, determine the instantaneous rate of change of temperature at the point (2,1) if the 
person is instead walking due north. Again, include units on your result. 

(c) Now, rather than walking due east or due north, let's suppose that the person is walking 
with velocity given by the vector v = (3, 4), where time is measured in seconds. Note that 
the person's speed is thus |v| = 5 feet per second. 

Find parametric equations for the person's path; that is, parameterize the line through 
(2,1) using the direction vector v = (3,4). Let x(t) denote the x-coordinate of the line, and 
y(t) its //-coordinate. 

(d) With the parameterization in (c), we can now view the temperature / as not only a function 

of x and y, but also of time, t. Hence, use the chain rule to determine the value of '-jjj | „. 

What are the units on your answer? What is the practical meaning of this result? 
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Directional Derivatives 

Given a function z = f(x. y ), the partial derivative f x (x o, yo) measures the instantaneous rate of 
change of / as only the x variable changes; likewise, f y {xo,yo) measures the rate of change of / at 
(xq, yo) as only y changes. Note particularly that f x (x o, yo) is measured in "units of / per unit of 
change in x," and that the units on f y (x o, yo) are similar. 

In Preview Activity 10.6, we saw how we could measure the rate of change of / in a situation 
where both x and y were changing; in that activity, however, we found that this rate of change was 
measured in "units of / per unit of time." In a given unit of time, we may move more than one 
unit of distance. In fact, in Preview Activity 10.6, in each unit increase in time we move a distance 
of |v| = 5 feet. To generalize the notion of partial derivatives to any direction of our choice, we 
instead want to have a rate of change whose units are "units of / per unit of distance in the given 
direction." 

In this light, in order to formally define the derivative in a particular direction of motion, we 
want to represent the change in / for a given unit change in the direction of motion. We can 
represent this unit change in direction with a unit vector, say u = (u\, 112 ). If we move a distance h 
in the direction of u from a fixed point (xo, yo), we then arrive at the new point (xo + u\h, yo + u-^h). 
It now follows that the slope of the secant line to the curve on the surface through (xo, yo) i n the 
direction of u through the points (xo, yo) and (x<i + u 1 h. yo + wWi) is 

f{xo + u 1 h,y 0 + u 2 h) - f(x 0 ,y 0 ) nnQ s 

msec = -;-• (10.9) 
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To get the instantaneous rate of change of / in the direction u = (txi, 7x2), we must take the 
limit of the quantity in Equation (10.9) as h —> 0. Doing so results in the formal definition of the 
directional derivative. 


Definition 10.4. Let / = f(x, y ) be given. The derivative of / at the point (x, y) in the direc¬ 
tion of the unit vector u = ( 7 x 1 , 7 x 2 ) is denoted D u f(x, y) and is given by 


D u f{x, y) = lim 
ft—70 


f(x + uih, y + u 2 h) - f(x, y) 
h 


( 10 . 10 ) 


for those values of x and y for which the limit exists. 


The quantity D u f(x,y) is called a directional derivative. When we evaluate the directional 
derivative D u f(x,y ) at a point ( xq , yo), the result D u f(xo,yo) tells us the instantaneous rate at 
which / changes at (xo, yo) per unit increase in the direction of the vector u. In addition, the quan¬ 
tity D u f(xo,yo) tells us the slope of the line tangent to the surface in the direction of u at the point 

{xo,yo,f(x 0 ,yo)). 


Computing the Directional Derivative 

In a similar way to how we developed shortcut rules for standard derivatives in single variable 
calculus, and for partial derivatives in multivariable calculus, we can also find a way to evaluate 
directional derivatives without resorting to the limit definition found in Equation (10.10). We do 
so using a very similar approach to our work in Preview Activity 10.6. 

Suppose we consider the situation where we are interested in the instantaneous rate of change 
of / at a point (xo,yo) in the direction u = (u\ . 7x2), where u is a unit vector. The variables x and y 
are therefore changing according to the parameterization 


x = xo + u\t and y = yo + 7 x 2 1. 

Observe that ^ = 7 x 1 and ^ = 7 x 2 for all values of t. Since u is a unit vector, it follows that a 
point moving along this line moves one unit of distance per one unit of time; that is, each single 
unit of time corresponds to movement of a single unit of distance in that direction. This observa¬ 
tion allows us to use the Chain Rule to calculate the directional derivative, which measures the 
instantaneous rate of change of / with respect to change in the direction u. 

In particular, by the Chain Rule, it follows that 


D u f(xo,yo) = fx(x 0 ,yo) 


dx 

dt 


(xo,yo) 


+f y (x 0 ,yo ) 


dy 

dt 


[xo,yo) 


= fx(x 0 ,yo)ui + fy(x 0,2/0) 7 X 2 . 


This now allows us to compute the directional derivative at an arbitrary point according to the 
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following formula. 

Given a differentiable function / = f(x, y) and a unit vector u = (u \, u 2 ), we may compute 
D u ffay )by 

D u f{x, y) = f x {x, y)ui + f y (x, y)u 2 . (10.11) 


Note well: To use Equation (10.11), we must have a unit vector u = (u\,u 2 ) in the direction of 
motion. In the event that we have a direction prescribed by a non-unit vector, we must first scale 
the vector to have length 1. 

Activity 10.16. 

Let f(x,y) = 3 xy-x 2 y 3 . 

(a) Determine f x (x, y) and f y fa y). 

(b) Use Equation (10.11) to determine D{f(x,y) and Djf(x,y). What familiar function is 
D\f? What familiar function is Dj/? 

(c) Use Equation (10.11) to find the derivative of / in the direction of the vector v = (2,3) 
at the point (1,-1). Remember that a unit direction vector is needed. 

<1 


The Gradient 

Via the Chain Rule, we have seen that for a given function / = f(x, y), its instantaneous rate of 
change in the direction of a unit vector u = {u\,u 2 ) is given by 

D u f(x o, 2 / 0 ) = fxfa, yo)ui + f y fa, y 0 )u 2 . (10.12) 

Recalling that the dot product of two vectors v = (vi,v 2 ) and u = (u\,u 2 ) is computed by 

V • u = V\U\ + V 2 U 2 , 

we see that we may recast Equation (10.12) in a way that has geometric meaning. In particular, we 
see that D u f(x o, yo ) is the dot product of the vector (f x fa,yo), fyfa, yo)) and the vector u. 

We call this vector formed by the partial derivatives of / the gradient of / and denote it 

V/(z 0 ,2/ 0 ) = (fxfa,yo),f y fa,yo)) ■ 

We read V/ as "the gradient of /," "grad /" or "del /". 4 Notice that V/ varies from point to point. 
In the following activity, we investigate some of what the gradient tells us about the behavior of a 
function /. 


4 The symbol V is called nabla, which comes from a Greek word for a certain type of harp that has a similar shape. 
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Activity 10.17. 

Let's consider the function / defined by f(x,y) = x 2 — y 2 . Some contours for this function are 
shown in Figure 10.41. 



Figure 10.41: Contours of f(x, y) = x 2 — y 2 . 


(a) Find the gradient V/(x, y). 

(b) For each of the following points (xo,yo), evaluate the gradient V/(xo, yo) and sketch 
the gradient vector with its tail at (xo, yo)- Some of the vectors are too long to fit onto 
the plot, but we'd like to draw them to scale; to do so, scale each vector by a factor of 
1/4. 

• Oo,yo) = (2,0) 

• Oh),yo) = (0,2) 

• (xo,yo) = (2,2) 

• (xo,yo) = (2,1) 

• (xo,yo) = (-3,2) 

• (xo,yo) = (-2,-4) 

• (xo,yo) = (0,0) 

(c) What do you notice about the relationship between the gradient at (xo, yo) and the con¬ 
tour line passing through that point? 

(d) Does / increase or decrease in the direction of V/(xo,yo)? Provide a justification for 
your response. 


<1 

As a vector, V/(xo,yo) defines a direction and a length. As we will soon see, both of these 
convey important information about the behavior of / near (xq, yo)- 
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The Direction of the Gradient 

Remember that the dot product also conveys information about the angle between the two vectors. 
If 0 is the angle between V/(xo, yo) and u (where u is a unit vector), then we also have that 

D u f{xo,yo) = V/(x 0 ,2/o) ■ u = |V/(x 0 ,y 0 )||u| cos(0). 


In particular, when 6 is a right angle, as shown on the left of Figure 10.42, then D u f(x o, yo) = 
0, because cos(0) = 0. Since the value of the directional derivative is 0, this means that / is 
unchanging in this direction, and hence u must be tangent to the contour of / that passes through 
(xo, yo)- In other words, Vf(xo,yo) is orthogonal to the contour through (xo, yo)- This shows that 
the gradient vector at a given point is always perpendicular to the contour passing through the 
point, confirming that what we saw in part (c) of Activity 10.17 holds in general. 



Figure 10.42: The sign of D u f(x o, yo) is determined by 9. 

Moreover, when 9 is an acute angle, it follows that cos(0) > 0 so since 

D u f(xo,yo ) = |V/(x 0 ,y 0 )||u| cos(0), 

and therefore D u f(x o, yo) > 0, as shown in the middle image in Figure 10.42. This means that / 
is increasing in any direction where 0 is acute. In a similar way, when 0 is an obtuse angle, then 
cos(0) < 0 so D u f(x o, yo) < 0, as seen on the right in Figure 10.42. This means that / is decreasing 
in any direction for which 0 is obtuse. 

Finally, as we can see in the following activity, we may also use the gradient to determine the 
directions in which the function is increasing and decreasing most rapidly. 

Activity 10.18. 

In this activity we investigate how the gradient is related to the directions of greatest increase 
and decrease of a function. Let / be a differentiable function and u a unit vector. 

(a) Let 0 be the angle between V/(xq, yo) and u. Explain why 
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D u f(x 0 ,yo) = \ {fx(xo,yo),fy(xo,yo))\cos(9). 


(10.13) 
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(b) At the point (xq, yo), the only quantity in Equation (10.13) that can change is 9 (which 
determines the direction u of travel). Explain why 9 = 0 makes the quantity 


\(fx(xo,yo),fy(xo,yo))\cos(9) 


as large as possible. 

(c) When 9 = 0, in what direction does the unit vector u point relative to V/(.xq. yo)? Why? 
What does this tell us about the direction of greatest increase of / at the point (.xq, yo)? 

(d) In what direction, relative to V/(xq. yo), does / decrease most rapidly at the point 

Oo,2/o)? 

(e) State the unit vectors u and v (in terms of V f(xo,yo)) that provide the directions of 
greatest increase and decrease for the function / at the point (xo, yo)- What important 
assumption must we make regarding Vf(xo,yo) in order for these vectors to exist? 

<1 

The Length of the Gradient 

Having established in Activity 10.18 that the direction in which a function increases most rapidly 
at a point (xo, yo) is the unit vector u in the direction of the gradient, (that is, u = vf(i» :w )| V/(.xo, yo), 
provided that Vf(xo, yo) A 0), it is also natural to ask, "in the direction of greatest increase for / 
at (xo, yo), what is the value of the rate of increase?" In this situation, we are asking for the value 

of D u f(x 0 , yo) where u = |v/(i 0 ,y 0 )| V /( x °’ Vo)- 

Using the now familiar way to compute the directional derivative, we see that 



Next, we recall two important facts about the dot product: (i) w • (cv) = c(w • v) for any scalar 
c, and (ii) w ■ w = |w| 2 . Applying these properties to the most recent equation involving the 
directional derivative, we find that 



Finally, since V f(xo, yo) is a nonzero vector, its length V/(xq. yo) is a nonzero scalar, and thus we 
can simplify the preceding equation to establish that 

D u f(x 0 ,yo) = |V/(x 0 , yo)|- 

We summarize our most recent work by stating two important facts about the gradient. 

Let / be a differentiable function and (xo, yo) a point for which V/(xq. yo) A 0. Then V/(.x'o- yo) 
points in the direction of greatest increase of / at (xo, yo), and the instantaneous rate of change 
of / in that direction is the length of the gradient vector. That is, if u = |vf(i 0 yo )\ W(To> Vo), 
then u is a unit vector in the direction of greatest increase of / at (xo, yo), and D u f(x o, yo) = 

|V/(x 0 ,y 0 )|- 
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Activity 10.19. 

Consider the function / defined by f(x, y) = 2x 2 — xy + 2 y. 

(a) Find the gradient V/(l, 2) and sketch it on Figure 10.43. 

4 

3 

2 

1 


12 3 4 

Figure 10.43: A plot for the gradient V/( 1,2). 

(b) Sketch the unit vector z = ^ —-C,—on Figure 10.43 with its tail at (1,2). Now find 
the directional derivative D z f( 1,2). 

(c) What is the slope of the graph of / in the direction z? What does the sign of the direc¬ 
tional derivative tell you? 

(d) Consider the vector v = (2, —1) and sketch v on Figure 10.43 with its tail at (1, 2). Find 
a unit vector w pointing in the same direction of v. Without computing D w /(1, 2), what 
do you know about the sign of this directional derivative? Now verify your observation 
by computing D w /(1, 2). 

(e) In which direction (that is, for what unit vector u) is /!„/(!. 2) the greatest? What is the 
slope of the graph in this direction? 

(f) Corresponding, in which direction is D u /(1, 2) least? What is the slope of the graph in 
this direction? 

(g) Sketch two unit vectors u for which D u /(1, 2) = 0 and then find component represen¬ 
tations of these vectors. 

(h) Suppose you are standing at the point (3, 3). In which direction should you move to 
cause / to increase as rapidly as possible? At what rate does / increase in this direction? 


y 















X 
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Applications 

The gradient finds many natural applications. For example, situations often arise - for instance, 
constructing a road through the mountains or planning the flow of water across a landscape - 
where we are interested in knowing the direction in which a function is increasing or decreasing 
most rapidly. 

For example, consider a two-dimensional version of how a heat-seeking missile might work. 5 
Suppose that the temperature surrounding a fighter jet can be modeled by the function T defined 
by 

T( 1 - 100 

[X,y) 1 + (x - 5) 2 + %- 2.5) 2 ’ 

where (x, y) is a point in the plane of the fighter jet and T(x, y) is measured in degrees Celsius. 
Some contours and gradients VT are shown on the left in Figure 10.44. 
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Figure 10.44: Contours and gradient for T(x, y) and the missile's path. 

A heat-seeking missile will always travel in the direction in which the temperature increases 
most rapidly; that is, it will always travel in the direction of the gradient VT. If a missile is fired 
from the point (2,4), then its path will be that shown on the right in Figure 10.44. 

In the final activity of this section, we consider several questions related to this context of a 
heat-seeking missile, and foreshadow some upcoming work in Section 10.7. 

Activity 10.20. 

(a) The temperature T(x,y) has its maximum value at the fighter jet's location. State the 
fighter jet's location and explain how Figure 10.44 tells you this. 

(b) Determine VT at the fighter jet's location and give a justification for your response. 

(c) Suppose that a different function / has a local maximum value at (xo, ?/(>)■ Sketch the 
behavior of some possible contours near this point. What is V/(xo, yo)? 

(d) Suppose that a function g has a local minimum value at (xq, yg). Sketch the behavior of 
some possible contours near this point. What is Vg(xg, yo)? 


3 This application is borrowed from United States Air Force Academy Department of Mathematical Sciences at 

http : //www . nku.edu/~longa/classes/mat320/mathematica/multealc . htm. 


































156 


10.6. DIRECTIONAL DERIVATIVES AND THE GRADIENT 


(e) If a function g has a local minimum at (.To, Vo), what is the direction of greatest increase 

of g at (x 0 ,y 0 )? 




Summary 

• The directional derivative of / at the point (x, y) in the direction of the unit vector u = (ui. U 9 ) 

D„/(x, y) = lim /(I + V - + k " 2 '* ) ~ /( *’ y) 

h — h 

for those values of x and y for which the limit exists. In addition, D u f(x. y) measures the slope 
of the graph of / when we move in the direction u. Alternatively, D u /(xo,yo) measures the 
instantaneous rate of change of / in the direction u at (xo, IJo) ■ 

• The gradient of a function f = f(x,y) at a point (xo, yo) is the vector 

V/(x 0 ,yo) = (fx(xo,yo),fy(xo,yo))- 

• The directional derivative in the direction u may be computed by 

D u f(x 0 , y 0 ) = V/(x 0 , yo) • u. 

• At any point where the gradient is nonzero, gradient is orthogonal to the contour through that 
point and points in the direction in which / increases most rapidly; moreover, the slope of / 
in this direction equals the length of the gradient |V/(xo,yo)|- Similarly, the opposite of the 
gradient points in the direction of greatest decrease, and that rate of decrease is the opposite of 
the length of the gradient. 


Exercises 

1. Let E(x,y) = --- represent the elevation on a land mass at location 

1 + (x - 5) 2 + 4(y - 2.5) 2 r 

(x, y). Suppose that E, x, and y are all measured in meters. 

(a) Find E x (x, y) and E y (x, y). 

(b) Let u be a unit vector in the direction of (—4,3). Determine D U E( 3,4). What is the 
practical meaning of D U E( 3,4) and what are its units? 

(c) Find the direction of greatest increase in E at the point (3,4). 

(d) Find the instantaneous rate of change of E in the direction of greatest decrease at the 
point (3,4). Include units on your answer. 

(e) At the point (3,4), find a direction w in which the instantaneous rate of change of E is 

0 . 

2. Let f(x, y) = x 2 + 3 y 2 . 
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(a) Find Vf(x,y) and V/(l,2). 

(b) Find the direction of greatest increase in / at the point (1,2). Explain. A graph of the 
surface defined by / is shown in Figure 10.45. Illustrate this direction on the surface. 

(c) A contour diagram of / is shown in Figure 10.46. Illustrate your calculation from (b) on 
this contour diagram. 



Figure 10.45: The surface for 

f(x,y) = x 2 + 3 y 2 . 



Figure 10.46: Contours for f(x,y) = 
x 2 + 3 y 2 . 


(d) Find a direction w for which the slope of the tangent line to the surface generated by / 
at the point (1, 2) is zero in the direction w. 

3. The properties of the gradient that we have observed for functions of two variables also hold 
for functions of more variables. In this problem, we consider a situation where there are three 
independent variables. Suppose that the temperature in a region of space is described by 

T(x,y,z) = lOOe-* 2 -*' 2 -* 2 

and that you are standing at the point (1, 2, —1). 

(a) Find the instantaneous rate of change of the temperature in the direction of v = (0,1, 2) 
at the point (1, 2, —1). Remember that you should first find a unit vector in the direction 
of v. 

(b) In what direction from the point (1, 2, —1) would you move to cause the temperature to 
decrease as quickly as possible? 

(c) How fast does the temperature decrease in this direction? 

(d) Find a direction in which the temperature does not change at (1, 2, —1). 
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Figure 10.47: The gradient V/. 


4. Figure 10.47 shows a plot of the gradient V/ at several points for some function / = f(x, y ). 

(a) Consider each of the three indicated points, and draw, as best as you can, the contour 
through that point. 

(b) Beginning at each point, draw a curve on which / is continually decreasing. 
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10.7 Optimization 

Motivating Questions 

In this section, zve strive to understand the ideas generated by the following important questions: 

• How can we find the points at which f(x, y) has a local maximum or minimum? 

• How can we determine whether critical points of f(x, y) are local maxima or minima? 

• How can we find the absolute maximum and minimum of f(x, y) on a closed and bounded 
domain? 


Introduction 

We learn in single-variable calculus that the derivative is a useful tool for finding the local maxima 
and minima of functions, and that these ideas may often be employed in applied settings. In 
particular, if a function /, such as the one shown in Figure 10.48 is everywhere differentiable, we 
know that the tangent line is horizontal at any point where / has a local maximum or minimum. 
This, of course, means that the derivative f is zero at any such point. Hence, one way that we 
seek extreme values of a given function is to first find where the derivative of the function is zero. 



Figure 10.48: The graph of y = f(x). 

In multivariable calculus, we are often similarly interested in finding the greatest and/or least 
value(s) that a function may achieve. Moreover, there are many applied settings in which a quan¬ 
tity of interest depends on several different variables. In the following preview activity, we begin 
to see how some key ideas in multivariable calculus can help us answer such questions by thinking 
about the geometry of the surface generated by a function of two variables. 

Preview Activity 10.7. Let z = f(x, y) be a differentiable function, and suppose that at the point 
( x o. ?/o), / achieves a local maximum. That is, the value of f(xo,yo ) is greater than the value 
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of f(x,y) for all (x. y) nearby (xq, yo). You might find it helpful to sketch a rough picture of a 
possible function / that has this property. 

(a) If we consider the trace given by holding y = yo constant, then the single-variable function 
defined by f(x, yo) must have a local maximum at xq. What does this say about the value 
of the partial derivative f x {xo, yo)? 

(b) In the same way, the trace given by holding x = xo constant has a local maximum at y = yo- 
What does this say about the value of the partial derivative f y (x o, yo)? 

(c) What may we now conclude about the gradient V/(xo, yo) at the local maximum? How is 
this consistent with the statement “ f increases most rapidly in the direction V/(xo, yo)?” 

(d) How will the tangent plane to the surface z = /(x , y) appear at the point (xo, yo, /(xo, yo))? 

(e) By first computing the partial derivatives, find any points at which /(x, y) = 2x — x 2 — (y + 
2) 2 may have a local maximum. 




Extrema and Critical Points 

One of the important applications of single-variable calculus is the use of derivatives to identify 
local extremes of functions (that is, local maxima and local minima). Using the tools we have 
developed so far, we can naturally extend the concept of local maxima and minima to several- 
variable functions. 


Definition 10.5. Let / be a function of two variables x and y. 

• The function / has a local maximum at a point (xo, yo) provided that /(x, y) < f(x o, yo) 
for all points (x, y) near (xo, yo)- In this situation we say that /(xo, yo) is a local maximum 
value. 

• The function / has a local minimum at a point (xo, yo) provided that /(x, y) > f(x o, yo) 
for all points (x, y) near (xo, yo)- In this situation we say that /(xo, yo) is a local minimum 
value. 

• An absolute maximum point is a point (xo, yo) for which /(x, y) < /(xo, yo) for all points 
(x, y) in the domain of /. The value of / at an absolute maximum point is the maximum 
value of /. 

• An absolute minimum point is a point such that /(x, y) > /(xo, yo) for all points (x, y) 
in the domain of /. The value of / at an absolute minimum point is the maximum value 

of/. 
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We use the term extremum point to refer to any point (xo, yo ) at which / has a local maximum 
or minimum. In addition, the function value /(x o, yo ) at an extremum is called an extremal value. 
Figure 10.49 illustrates the graphs of two functions that have an absolute maximum and minimum, 
respectively, at the origin (xo, yo) = (0,0). 




Figure 10.49: An absolute maximum and an absolute minimum 

In single-variable calculus, we saw that the extrema of a continuous function / always occur 
at critical points, values of x where / fails to be differentiable or where f'(x) = 0. Said differently, 
critical points provide the locations where extrema of a function may appear. Our work in Preview 
Activity 10.7 suggests that something similar happens with two-variable functions. 

Suppose that a continuous function / has an extremum at (xq. yo)- In this case, the trace 
/(x, yo) has an extremum at xo, which means that xo is a critical value of /(x, yo). Therefore, either 
fx(x o, y 0 ) does not exist or f x (x 0 , yo) = 0. Similarly, either f y (x 0 , yo) does not exist or f y (x 0 , y 0 ) = 0. 
This implies that the extrema of a two-variable function occur at points that satisfy the following 
definition. 


Definition 10.6. A critical point (xo, yo) of a function / = /(x, y) is a point in the domain of 
/ at which f x (x 0 , yo) = 0 and f y (x 0 , yo) = 0, or such that one of f x (x 0 ,yo) or f y (x 0 , y 0 ) fails to 
exist. 


We can therefore find critical points of a function / by computing partial derivatives and iden¬ 
tifying any values of (x, y) for which one of the partials doesn't exist or for which both partial 
derivatives are simultaneously zero. For the latter, note that we have to solve the system of equa¬ 
tions 


fx(x,y) = 0 
f y {x,y) = 0. 
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Activity 10.21. 

Find the critical points of each of the following functions. Then, using appropriate technology 
(e.g., WolframjAlpha or CalcPlot3D 6 ), plot the graphs of the surfaces near each critical value 
and compare the graph to your work. 

(a) f{x,y) = 2 + x 2 + y 2 

(b) f{x,y) = 2 + x 2 -y 2 

(c) f(x,y) = 2x - x 2 - \y 2 

(d) f(x,y) = \x\ + \y\ 

(e) f(x, y) = 2 xy - 4x + 2y - 3. 
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Classifying Critical Points: The Second Derivative Test 

While the extrema of a continuous function / always occur at critical points, it is important to 
note that not every critical point leads to an extremum. Recall, for instance, f(x) = x 3 from single 
variable calculus. We know that xo = 0 is a critical point since f'(x o) = 0, but xq = 0 is neither a 
local maximum nor a local minimum of /. 

A similar situation may arise in a multivariable setting. Consider the function / defined by 
f(x,y) = x 2 — y 2 whose graph and contour plot are shown in Figure 10.50. Because V/ = 
(2x, —2 y), we see that the origin (xo, yo) = (0,0) is a critical point. However, this critical point 
is neither a local maximum or minimum; the origin is a local minimum on the trace defined by 
y = 0, while the origin is a local maximum on the trace defined by x = 0. We call such a critical 
point a saddle point due to the shape of the graph near the critical point. 




Figure 10.50: A saddle point. 


6 at http : //web.monroecc . edu/manila/webfiles/calcNSF/JavaCode/CalcPlot3D . htm 
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As in single-variable calculus, we would like to have some sort of test to help us identify 
whether a critical point is a local maximum, local maximum, or neither. Before describing the test 
that accomplishes this classification in a two-variable setting, we recall the Second Derivative Test 
from single-variable calculus. 

The Second Derivative Test for single-variable functions. If xq is a critical point of a function 
/ so that f'{x o) = 0 and if f"(x o) exists, then 

• if f"(x o) < 0, xo is a local maximum, 

• if f"(x o) > 0, xq is a local minimum, and 

• if f"(x o) = 0, this test yields no information. 


For the analogous test for functions of two variables, we have to consider all of the second- 
order partial derivatives. 

The Second Derivative Test for two-variable functions. Suppose (xo, yo) is a critical point of 
the function / for which f x (x o, yo) = 0 and f y (xq , yo) = 0. Let D be the quantity defined by 

D = fxx(xo,yo)fyy(xo,yo) fxy[xo,yo) ■ 

• If D > 0 and f xx (x o, yo) < 0, then / has a local maximum at (xo, yo). 

• If D > 0 and f xx (xo, yo) >0, then / has a local minimum at (xo, yo)- 

• If D < 0, then / has a saddle point at (xo, yo)- 

• If D = 0, then this test yields no information about what happens at (xo, yo)- 
The quantity D is called the discriminant of the function / at (xq, yo)- 


To properly understand the origin of the Second Derivative Test, we could introduce a "second- 
order directional derivative." If this second-order directional derivative were negative in every 
direction, for instance, we could guarantee that the critical point is a local maximum. A complete 
justification of the Second Derivative Test requires key ideas from linear algebra that are beyond 
the scope of this course, so instead of presenting a detailed explanation, we will accept this test as 
stated and demonstrate its use in three basic examples. 
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Example 10.1. Let f(x, y ) = 4 — x 2 — y 2 as shown in 
Figure 10.51. Critical points occur when f x = —2.x = 0 
and f y = —2 y = 0 so the origin (xo,yo) = (0,0) is the 
only critical point. We then find that 

fxx{ o, o) = -2, f yy ( 0, 0) = -2, and f xy ( 0,0) = 0, 

giving D = (—2) (—2) — 0 2 = 4 > 0. We then con¬ 
sider /ex( 0,0) = —2 < 0 and conclude, from the Sec¬ 
ond Derivative Test, that /(0,0) = 4 is a local maximum 
value. 

Example 10.2. Let /(x, y) = x 2 + y 2 as shown in Figure 

10.52. Critical points occur when f x = 2x = 0 and f y = 
2y = 0 so the origin (xo, yo) = (0, 0) is the only critical 
point. We then find that 

fxx{ o, o) = 2, fyy{ 0, 0) = 2, and f xy { 0, 0) = 0, 

giving D = 2-2 — 0 2 = 4>0. We then consider 
f xx { 0,0) = 2 > 0 and conclude, from the Second Deriva¬ 
tive Test, that /(0, 0) = 0 is a local minimum value. 

Example 10.3. Let /(x, y) = x 2 — y 2 as shown in Figure 

10.53. Critical points occur when f x = 2x = 0 and f y = 
—2y = 0 so the origin (.xq, yo) = (0, 0) is the only critical 
point. We then find that 

f xx { 0, 0) = 2, f yy ( 0,0) = -2, and f xy ( 0,0) = 0, 

giving D = 2 ■ (—2) — 0 2 = —4 < 0. We then conclude, 
from the Second Derivative Test, that (0, 0) is a saddle 
point. 



Figure 10.51: z = 4 — x 2 — y 2 



Figure 10.52: z = x 2 + y 2 



Figure 10.53: z = x 2 - y 2 


Activity 10.22. 

Find the critical points of the following functions and use the Second Derivative Test to classify 
the critical points. 

(a) f(x, y) = 3x 3 + y 2 - 9x + 4y 

(b) f(x,y) =xy + l + ^ 

(c) f(x,y) = x 3 + y 3 — 3xy. 

<1 

As we learned in single-variable calculus, finding extremal values of functions can be partic¬ 
ularly useful in applied settings. For instance, we can often use calculus to determine the least 
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expensive way to construct something or to find the most efficient route between two locations. 
The same possibility holds in settings with two or more variables. 

Activity 10.23. 

While the quantity of a product demanded by consumers is often a function of the price of 
the product, the demand for a product may also depend on the price of other products. For 
instance, the demand for blue jeans at Old Navy may be affected not only by the price of the 
jeans themselves, but also by the price of khakis. 

Suppose we have two goods whose respective prices are p\ and P 2 - The demand for these 
goods, q\ and q->, depend on the prices as 


q\ = 150 - 2 Pl - p 2 (10.14) 

qo = 200 — pi — 3p2 ■ (10.15) 

The seller would like to set the prices p\ and p-> in order to maximize revenue. We will assume 
that the seller meets the full demand for each product. Thus, if we let R be the revenue obtained 
by selling q\ items of the first good at price p\ per item and (p) items of the second good at price 
P 2 per item, we have 

R = piqi +P2<?2- 

We can then write the revenue as a function of just the two variables p\ and p 2 by using Equa¬ 
tions (10.14) and (10.15), giving us 

R(pi,P'2) = Pi(150 - 2pi - p 2 ) +P2(200 - Pi - 3p2) = 150pi + 200^2 - 2pip2 - 2p\ - 3^2- 
A graph of R as a function of p\ and pi is shown in Figure 10.54. 



Figure 10.54: A revenue function. 


(a) Find all critical points of the revenue function, R. 
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(b) Apply the Second Derivative Test to determine the type of any critical points. 

(c) Where should the seller set the prices pi and [>2 to maximize the revenue? 


<1 


Optimization on a Restricted Domain 

The Second Derivative Test helps us classify critical points of a function, but it does not tell us 
if the function actually has an absolute maximum or minimum at each such point. For single¬ 
variable functions, the Extreme Value Theorem told us that a continuous function on a closed 
interval [o, b] always has both an absolute maximum and minimum on that interval, and that these 
absolute extremes must occur at either an endpoint or at a critical point. Thus, to find the absolute 
maximum and minimum, we determine the critical points in the interval and then evaluate the 
function at these critical point s and at the endpoints of the interval. A similar approach works for 
functions of two variables. 

For functions of two variables, closed and bounded regions play the role that closed intervals 
did for functions of a single variable. A closed region is a region that contains its boundary (the 
unit disk x 2 + y 2 < 1 is closed, while its interior x 2 + y 2 < 1 is not, for example), while a bounded 
region is one that does not stretch to infinity in any direction. Just as for functions of a single 
variable, continuous functions of several variables that are defined on closed, bounded regions 
must have absolute maxima and minima in those regions. 

The Extreme Value Theorem. Let / = f(x, y) be a continuous function on a closed and 

bounded region R. Then / has an absolute maximum and an absolute minimum in R. 


The absolute extremes must occur at either a critical point in the interior of R or at a boundary 
point of R. We therefore must test both possibilities, as we demonstrate in the following example. 

Example 10.4. Suppose the temperature T at each point on the circular plate x 2 + y 2 < 1 is given 
by 

T(x,y) = 2 x 2 + y 2 - y. 

The domain R = {(x,y) : x 2 + y 2 < 1} is a closed and bounded region, as shown on the left 
of Figure 10.55, so the Extreme Value Theorem assures us that T has an absolute maximum and 
minimum on the plate. The graph of T over its domain R is shown in Figure 10.55. We will find 
the hottest and coldest points on the plate. 

If the absolute maximum or minimum occurs inside the disk, it will be at a critical point so we 
begin by looking for critical points inside the disk. To do this, notice that critical points are given 
by the conditions T x = 4.x = 0 and T y = 2y — 1 = 0. This means that there is one critical point of 
the function at the point (xo, yo) = (0,1/2), which lies inside the disk. 

We now find the hottest and coldest points on the boundary of the disk, which is the circle of 
radius 1. As we have seen, the points on the unit circle can be parametrized as 

x(t) = cos(f), y(t) = sin(t), 
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where 0 < t < 2tt. The temperature at a point on the circle is then described by 

T(x(t),y(t )) = 2 cos 2 (t) + sin 2 (i) — sin(f). 

To find the hottest and coldest point on the boundary, we look for the critical points of this 
single-variable function on the interval 0 <t< 2tt. We have 

dT 

— = —4 cos(t) sin(f) + 2 cos(t) sin (t) — cos(t) = —2 cos(f) sin(t) — cos (t) = cos(t) (—2 sin(f) — 1) = 0. 

This shows that we have critical points when cos(f) = 0 or sin(t) = —1/2. This occurs when 
t = 7t/2, 37t/2, 7ir/6, and llm/6. Since we have x(t) = cos(f) and y(t ) = sin(f), the corresponding 
points are 


• (x, y) = (0,1) when t = f, • (x, y) = when t = ' I f, 

• (x,y) = (0,-1) when t = • (x, y) = when t = 

These are the critical points of T on the boundary and so this collection of points includes the 
hottest and coldest points on the boundary. 

We now have a list of candidates for the hottest and coldest points: the critical point in the 
interior of the disk and the critical points on the boundary. We find the hottest and coldest points 
by evaluating the temperature at each of these points, and find that 

• T (°w)=-i • T (0,1) = 0, . T (0,-1) = 2, 

m j 1 (_V _1 __ 9 m t ( —I'l _2 

V 2 ’ 2/ 4' V 2 ’ 2/ I' 
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So the maximum value of T on the disk x 2 + y 2 < 1 is |, which occurs at the two points 
— 77 j on the boundary, and the minimum value of T on the disk is — \ which occurs at the 

critical point (0, in the interior of R. 

From this example, we see that we use the following procedure for determining the absolute 

maximum and absolute minimum of a function on a closed and bounded domain. 

Step 1: Find all critical points of the function in the interior of the domain. 

Step 2: Find all the critical points of the function on the boundary of the domain. Working on 
the boundary of the domain reduces this part of the problem to one or more single variable 
optimization problems. 

Step 3: Evaluate the function at each of the points found in Steps 1 and 2. 

Step 4: The maximum value of the function is the largest value obtained in Step 3, and the mini¬ 
mum value of the function is the smallest value obtained in Step 3. 

Activity 10.24. 

Let f(x, y ) = x 2 — 3 y 2 — Ax + 6 y with triangular domain R whose vertices are at (0,0), (4, 0), 
and (0,4). The domain R and a graph of / on the domain appear in Figure 10.56. 




Figure 10.56: The domain of f(x, y) = x 2 — 3 y 2 — 4x + 6 y and its graph. 

(a) Find all of the critical points of / in R. 

(b) Parameterize the horizontal leg of the triangular domain, and find the critical points of 
/ on that leg. 

(c) Parameterize the vertical leg of the triangular domain, and find the critical points of / 
on that leg. 

(d) Parameterize the hypotenuse of the triangular domain, and find the critical points of / 
on the hypotenuse. 
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(e) Find the absolute maximum and absolute minimum value of / on R. 








170 


10.7. OPTIMIZATION 


Summary 

• To find the extrema of a function / = f(x, y), we first find the critical points, which are points 
where one of the partials of / fails to exist, or where f x = 0 and f y = 0 . 

• The Second Derivative Test helps determine whether a critical point is a local maximum, local 
minimum, or saddle point. 

• If / is defined on a closed and bounded domain, we find the absolute maxima and minima 
by finding the critical points in the interior of the domain, finding the critical points on the 
boundary, and testing the value of / at both sets of critical points. 


Exercises 

1. Respond to each of the following prompts to solve the given optimization problem. 

(a) Let f(x. y) = sin (A) + cos (y). Determine the absolute maximum and minimum values 
of /. At what points do these extreme values occur? 

(b) For a certain differentiable function F of two variables x and y, its partial derivatives 
are 

F x (x, y) = x 2 - y - 4 and F y (x, y) = -x + y - 2. 

Find each of the critical points of F, and classify each as a local maximum, local mini¬ 
mum, or a saddle point. 

(c) Determine all critical points of T(x, y) = 48 + 3 xy — x 2 y — xy 2 and classify each as a 
local maximum, local minimum, or saddle point. 

(d) Find and classify all critical points of g(x, y) = ^ + 3 y 3 + 9 y 2 — 3 xy + 9y — 9x 

(e) Find and classify all critical points of z = f(x, y) = ye~ x2 ^ 2y2 . 

(f) Determine the absolute maximum and absolute minimum of f(x, y) = 2 + 2.x + 2y — 
x 2 — y 2 on the triangular plate in the first quadrant bounded by the lines x = 0 , y = 0 , 
and y = 9 — x. 

(g) Determine the absolute maximum and absolute minimum of f(x, y) = 2 + 2x + 2 y — 
x 2 — y 2 over the closed disk of points (x, y) such that (x — l ) 2 + (y — l ) 2 < 1 . 

(h) Find the point on the plane z = 6 — 3x — 2y that lies closest to the origin. 

2. If a continuous function / of a single variable has two critical numbers c\ and C 2 at which 
/ has relative maximum values, then / must have another critical number C 3 , because "it is 
impossible to have two mountains without some sort of valley in between. The other critical 
point can be a saddle point (a pass between the mountains) or a local minimum (a true valley )." 7 

7 From Calculus in Vector Spaces by Lawrence J. Corwin and Robert H. Szczarb. 
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Consider the function / defined by f(x, y) = Ax 2 e y — 2x 4 — e 4y . 8 Show that / has exactly two 
critical points of /, and that / has relative maximum values at each of these critical points. 
Explain how function illustrates that it really is possible to have two mountains without some 
sort of valley in between. Use appropriate technology to draw the surface defined by / to see 
graphically how this happens. 

3. If a continuous function / of a single variable has exactly one critical number with a relative 
maximum at that critical point, then the value of / at that critical point is an absolute maxi¬ 
mum. In this exercise we see that the same is not always true for functions of two variables. 
Let f(x, y) = 3xe y — x 3 — e 3y9 Show that / has exactly one critical point, has a relative maxi¬ 
mum value at that critical point, but that / has no absolute maximum value. Use appropriate 
technology to draw the surface defined by / to see graphically how this happens. 

4. A manufacturer wants to procure rectangular boxes to ship its product. The boxes must contain 
20 cubic feet of space. To be durable enough to ensure the safety of the product, the material for 
the sides of the boxes will cost $0.10 per square foot, while the material for the top and bottom 
will cost $0.25 per square foot. In this activity we will help the manufacturer determine the box 
of minimal cost. 

(a) What quantities are constant in this problem? What are the variables in this problem? 
Provide appropriate variable labels. What, if any, restrictions are there on the variables? 

(b) Using your variables from (a), determine a formula for the total cost C of a box. 

(c) Your formula in part (b) might be in terms of three variables. If so, find a relationship 
between the variables, and then use this relationship to write C as a function of only 
two independent variables. 

(d) Find the dimensions that minimize the cost of a box. Be sure to verify that you have a 
minimum cost. 

5. A rectangular box with length x, width y, and height z is being built. The box is positioned 
so that one corner is stationed at the origin and the box lies in the first octant where x, y, 
and z are all positive. There is an added constraint on how the box is constructed: it must fit 
underneath the plane with equation x + 2y + 3z = 6 . In fact, we will assume that the corner of 
the box "opposite" the origin must actually lie on this plane. The basic problem is to find the 
maximum volume of the box. 

(a) Sketch the plane x + 2y + 3z = 6 , as well as a picture of a potential box. Label everything 
appropriately. 

(b) Explain how you can use the fact that one corner of the box lies on the plane to write 
the volume of the box as a function of x and y only. Do so, and clearly show the formula 
you find for V(x,y). 

8 From Ira Rosenholz in the Problems Section of the Mathematics Magazine, Vol. 60 NO. 1, February 1987. 

9 From "'The Only Critical Point in Town" Test' by Ira Rosenholz and Lowell Smylie in the Mathematics Magazine, 

VOL 58 NO 3 May 1985. 
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Find all critical points of V. (Note that when finding the critical points, it is essential 
that you factor first to make the algebra easier.) 

Without considering the current applied nature of the function V, classify each critical 
point you found above as a local maximum, local minimum, or saddle point of V. 

Determine the maximum volume of the box, justifying your answer completely with an 
appropriate discussion of the critical points of the function. 

Now suppose that we instead stipulated that, while the vertex of the box opposite the 
origin still had to lie on the plane, we were only going to permit the sides of the box, x 
and y, to have values in a specified range (given below). That is, we now want to find 
the maximum value of V on the closed, bounded region 

\<x<\, l<y<2. 

Find the maximum volume of the box under this condition, justifying your answer fully. 

6 . The airlines place restrictions on luggage that can be carried onto planes. 

• A carry-on bag can weigh no more than 40 lbs. 

• The length plus width plus height of a bag cannot exceed 45 inches. 

• The bag must fit in an overhead bin. 

Let x, y, and z be the length, width, and height (in inches) of a carry on bag. In this problem we 
find the dimensions of the bag of largest volume, V = xyz, that satisfies the second restriction. 
Assume that we use all 45 inches to get a maximum volume. (Note that this bag of maximum 
volume might not satisfy the third restriction.) 

(a) Write the volume V = V(x, y) as a function of just the two variables x and y. 

(b) Explain why the domain over which V is defined is the triangular region It with vertices 
(0,0), (45,0), and (0,45). 

(c) Find the critical points, if any, of V in the interior of the region R. 

(d) Find the maximum value of V on the boundary of the region R, and the determine the 
dimensions of a bag with maximum volume on the entire region R. (Note that most 
carry-on bags sold today measure 22 by 14 by 9 inches with a volume of 2772 cubic 
inches, so that the bags will fit into the overhead bins.) 

7. According to The Song of Insects by G.W. Pierce (Harvard College Press, 1948) the sound of 
striped ground crickets chirping, in number of chirps per second, is related to the temperature. 
So the number of chirps per second could be a predictor of temperature. The data Pierce col¬ 
lected is shown in the table below, where x is the (average) number of chirps per second and y 
is the temperature in degrees Fahrenheit. 


(c) 

(d) 

(e) 

(f) 
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X 

y 

20.0 

88.6 

16.0 

71.6 

19.8 

93.3 

18.4 

84.3 

17.1 

80.6 

15.5 

75.2 

14.7 

69.7 

17.1 

82.0 

15.4 

69.4 

16.2 

83.3 

15.0 

79.6 

17.2 

82.6 

16.0 

80.6 

17.0 

83.5 

14.4 

76.3 


A scatter plot of the data is given below. The relationship between x and y is not exactly 
linear, but looks to have a linear pattern. It could be that the relationship is really linear but 
experimental error causes the data to be slightly inaccurate. Or perhaps the data is not linear, 
but only approximately linear. 


If we want to use the data to make predications, then we need to fit a curve of some kind to 
the data. Since the cricket data appears roughly linear, we will fit a linear function / of the 
form f(x) = rrix + b to the data. We will do this in such a way that we minimize the sums of 
the squares of the distances between the y values of the data and the corresponding y values 
of the line defined by /. This type of fit is called a least squares approximation. If the data is 
represented by the points (x\, y\), {x->. y-i), ■ ■ ■, (x n ■ Dn), then the square of the distance between 
yi and f(xi) is ( f(xi ) — yi) 2 = ( mxi + b — yi) 2 . So our goal is to minimize the sum of these 
squares, of minimize the function S defined by 

n 

S(m, b) = ^(mxi + b- yi) 2 . 

i=l 


(a) Calculate S m and Si,. 

(b) Solve the system S m (m, b) = 0 and Shim, b) = 0 to show that the critical point satisfies 

«(E?,1 y) - (Eli A 
, _ (£2,1 Si) (£2=1 A - (£2,1 Si) (£?.i 

n (£2.i x i) - (£2=i x S 


@®®@ 























174 


10.7. OPTIMIZATION 


(Hint: Don't be daunted by these expressions, the system S m (m , b) = 0 and S&(m, b) = 0 
is a system of two linear equations in the unknowns m and b. It might be easier to let 

r = Ya= 1 x i / s = Y!i=\ x ir t = Yh= i Vi’ and u = Ya= i and write y our equations 
using these constants.) 

(c) Use the Second Derivative Test to explain why the critical point gives a local minimum. 
Can you then explain why the critical point gives an absolute minimum? 

(d) Use the formula from part (b) to find the values of m and b that give the line of best 
fit in the least squares sense to the cricket data. Draw your line on the scatter plot to 
convince yourself that you have a well-fitting line. 
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10.8 Constrained Optimization:Lagrange Multipliers 

Motivating Questions 

In this section, zve strive to understand the ideas generated by the follozving important questions: 


• What geometric condition enables us to optimize a function / = f(x,y) subject to a con¬ 
straint given by g(x, y ) = k, where k is a constant? 

• How can we exploit this geometric condition to find the extreme values of a function subject 
to a constraint? 


Introduction 

We previously considered how to find the extreme values of functions on both unrestricted do¬ 
mains and on closed, bounded domains. Other types of optimization problems involve maximiz¬ 
ing or minimizing a quantity subject to an external constraint. In these cases the extreme values 
frequently won't occur at the points where the gradient is zero, but rather at other points that 
satisfy an important geometric condition. These problems are often called constrained optimization 
problems and can be solved with the method of Lagrange Multipliers, which we study in this 
section. 

Preview Activity 10.8. According to U.S. postal regulations, the girth plus the length of a parcel 
sent by mail may not exceed 108 inches, where by "girth" we mean the perimeter of the smallest 
end. Our goal is to find the largest possible volume of a rectangular parcel with a square end that 
can be sent by mail. 10 If we let x be the length of the side of one square end of the package and y 
the length of the package, then we want to maximize the volume f(x, y) = x 2 y of the box subject 
to the constraint that the girth (4.x) plus the length (y) is as large as possible, or 4 x + y = 108. The 
equation 4 x + y = 108 is thus an external constraint on the variables. 

(a) The constraint equation involves the function g that is given by 

g(x, y) = 4x + y. 

Explain why the constraint is a contour of g, and is therefore a two-dimensional curve. 

(b) Ligure 10.57 shows the graph of the constraint equation g(x, y) = 108 along with a few 
contours of the volume function /. Since our goal is to find the maximum value of / 
subject to the constraint g(x, y) = 108, we want to find the point on our constraint curve 
that intersects the contours of / at which / has its largest value. 

10 We solved this applied optimization problem in single variable Active Calculus, so it may look familiar. We take 
a different approach in this section, and this approach allows us to view most applied optimization problems from 
single variable calculus as constrained optimization problems, as well as provide us tools to solve a greater variety of 
optimization problems. 
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Figure 10.57: Contours of / and the constraint equation g(x, y ) = 108. 


i. Points A and B in Figure 10.57 lie on a contour of / and on the constraint equation 
g(x, y ) = 108. Explain why neither A nor B provides a maximum value of / that 
satisfies the constraint. 

ii. Points C and D in Figure 10.57 lie on a contour of / and on the constraint equation 
g(x, y) = 108. Explain why neither C nor D provides a maximum value of / that 
satisfies the constraint. 

iii. Based on your responses to parts i. and ii., draw the contour of / on which you believe 
/ will achieve a maximum value subject to the constraint g(x,y) = 108. Explain why 
you drew the contour you did. 

(c) Recall that g(x, y) = 108 is a contour of the function g, and that the gradient of a function 
is always orthogonal to its contours. With this in mind, how should V/ and Vg be related 
at the optimal point? Explain. 


CXI 


Constrained Optimization and Lagrange Multipliers 

In Preview Activity 10.8, we considered an optimization problem where there is an external con¬ 
straint on the variables, namely that the girth plus the length of the package cannot exceed 108 
inches. We saw that we can create a function g from the constraint, specifically g(x, y) = 4x + y. 
The constraint equation is then just a contour of g, g(x. y) = c, where c is a constant (in our case 
108). Figure 10.58 illustrates that the volume function / is maximized, subject to the constraint 
g(x, y) = c, when the graph of g(x. y) = c is tangent to a contour of /. Moreover, the value of / on 
this contour is the sought maximum value. To find this point where the graph of the constraint is 
tangent to a contour of /, recall that V/ is perpendicular to the contours of / and Vg is perpen¬ 
dicular to the contour of g. At such a point, the vectors V g and V/ are parallel, and thus we need 
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Ligure 10.58: Contours of / and the constraint contour. 


to determine the points where this occurs. Recall that two vectors are parallel if one is a nonzero 
scalar multiple of the other, so we therefore look for values of a parameter A that make 

V/ = A Vg. (10.16) 

The constant A is called a Lagrange multiplier. 

To find the values of A that satisfy (10.16) for the volume function in Preview Activity 10.8, we 
calculate both V/ and Vg. Observe that 

V/ = 2xgi + x 2 j and Vg = 4i+j, 
and thus we need a value of A so that 


2xyi + x 2 j = A(4i + j). 


Equating components in the most recent equation and incorporating the original constraint, we 
have three equations 

2 xy = A(4) (10.17) 

x 2 = A(l) (10.18) 

4 x + y = 108 (10.19) 


in the three unknowns x, y, and A. First, note that if A = 0, then equation (10.18) shows that 
x = 0. From this. Equation (10.19) tells us that y = 108. So the point (0,108) is a point we need 
to consider. Next, provided that A 7 ^ 0 (from which it follows that x / 0 by Equation (10.18)), we 
may divide both sides of Equation (10.17) by the corresponding sides of (10.18) to eliminate A, and 
thus find that 

2 y a 

— = 4, so 

x 

y = 2 x. 
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Substituting into Equation (10.19) gives us 

4x + 2x = 108 


or 

x = 18. 

Thus we have y = 2x = 36 and A = x 2 = 324 as another point to consider. So the points at which 
the gradients of / and g are parallel, and thus at which / may have a maximum or minimum 
subject to the constraint, are (0,108) and (18, 36). By evaluating the function / at these points, we 
see that we maximize the volume when the length of the square end of the box is 18 inches and the 
length is 36 inches, for a maximum volume of /(18, 36) = 11664 cubic inches. Since /(0,108) = 0, 
we obtain a minimum value at this point. 

We summarize the process of Lagrange multipliers as follows. 

The general technique for optimizing a function / = f(x, y ) subject to a constraint g(x, y) = c 
is to solve the system V/ = Wg for x, y, and A. We then evaluate the function / at each point 
(x, y) that results from a solution to the system in order to find the optimum values of / subject 
to the constraint. 


Activity 10.25. 

A cylindrical soda can holds about 355 cc of liquid. In this activity, we want to find the dimen¬ 
sions of such a can that will minimize the surface area. 

(a) What are the variables in this problem? What restriction(s), if any, are there on these 
variables? 

(b) What quantity do we want to optimize in this problem? What equation describes the 
constraint? 

(c) Find A and the values of your variables that satisfy Equation (10.16) in the context of 
this problem. 

(d) Determine the dimensions of the pop can that give the desired solution to this con¬ 
strained optimization problem. 


<1 

The method of Lagrange multipliers also works for functions of more than two variables. 

Activity 10.26. 

Use the method of Lagrange multipliers to find the dimensions of the least expensive packing 
crate with a volume of 240 cubic feet when the material for the top costs $2 per square foot, the 
bottom is $3 per square foot and the sides are $1.50 per square foot. 

<1 
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Summary 

• The extrema of a function / = f(x,y ) subject to a constraint g(x,y ) = c occur at points for 
which the contour of / is tangent to the curve that represents the constraint equation. This 
occurs when 

V/ = A Vg. 

• We use the condition Vf = XVg to generate a system of equations, together with the constraint 
g(x, y) = c, that may be solved for x, y, and A. Once we have all the solutions, we evaluate / at 
each of the (x, y) points to determine the extrema. 


Exercises 

1. The Cobb-Douglas production function is used in economics to model production levels based 
on labor and equipment. Suppose we have a specific Cobb-Douglas function of the form 

f(x,y) = 50x°' 4 ?/ 0 ’ 6 , 

where x is the dollar amount spent on labor and y the dollar amount spent on equipment. Use 
the method of Lagrange multipliers to determine how much should be spent on labor and how 
much on equipment to maximize productivity if we have a total of $1.5 million dollars to invest 
in labor and equipment. 

2 . Use the method of Lagrange multipliers to find the point on the line x — 2y = 5 that is closest 
to the point (1,3). To do so, respond to the following prompts. 

(a) Write the function / = f(x,y ) that measures the square of the distance from (x, y) to 
(1,3). (The extrema of this function are the same as the extrema of the distance function, 
but f(x, y) is simpler to work with.) 

(b) What is the constraint g(x,y) = c? 

(c) Write the equations resulting from V f = XVg and the constraint. Lind all the points 
(x, y) satisfying these equations. 

(d) Test all the points you found to determine the extrema. 

3. Apply the Method of Lagrange Multipliers solve each of the following constrained optimiza¬ 
tion problems. 

(a) Determine the absolute maximum and absolute minimum values of f(x, y) = (x — l ) 2 + 
(y — 2) 2 subject to the constraint that x 2 + y 2 = 16. 

(b) Determine the points on the sphere x 2 + y 2 + z 2 = 4 that are closest to and farthest from 
the point (3,1, —1). (As in the preceding exercise, you may find it simpler to work with 
the square of the distance formula, rather than the distance formula itself.) 
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(c) Find the absolute maximum and minimum of f(x, y, z ) = x 2 + y 2 + z 2 subject to the 
constraint that (x — 3) 2 + (y + 2 ) 2 + (z — 5 ) 2 < 16. (Hint: here the constraint is a closed, 
bounded region. Use the boundary of that region for applying Lagrange Multipliers, 
but don't forget to also test any critical values of the function that lie in the interior of 
the region.) 

4. There is a useful interpretation of the Lagrange multiplier A. Assume that we want to optimize 
a function / with constraint g(x, y) = c. Recall that an optimal solution occurs at a point (xo, yo) 
where V/ = A Vg. As the constraint changes, so does the point at which the optimal solution 
occurs. So we can think of the optimal point as a function of the parameter c, that is xq = xq(c) 
and yo = yo(c). The optimal value of / subject to the constraint can then be considered as a 
function of c defined by f(xo(c),yo(c)). The Chain Rule shows that 

df = df_dxo | df dyp 
dc dxo dc dyo dc 


(a) Use the fact that V/ = A Vg at (xo, yo) to explain why 

df_ = .dg 
dc dc 

(b) Use the fact that g(x, y) = c to show that 



Conclude that A tells us the rate of change of the function / as the parameter c increases 
(or by approximately how much the optimal value of the function / will change if we 
increase the value of c by 1 unit). 

(c) Suppose that A = 324 at the point where the package described in Preview Activity 10.8 
has its maximum volume. Explain in context what the value 324 tells us about the 
package. 
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Chapter 11 

Multiple Integrals 

11.1 Double Riemann Sums and Double Integrals over Rectangles 

Motivating Questions 

In this section, we strive to understand the ideas generated by the following important questions: 

• What is a double Riemann sum? 

• How is the double integral of a continuous function / = f{x, y) defined? 

• What are two things the double integral of a function can tell us? 


Introduction 

In single-variable calculus, recall that we approximated the area under the graph of a positive 
function / on an interval [a, b] by adding areas of rectangles whose heights are determined by 
the curve. The general process involved subdividing the interval [a, b] into smaller subintervals, 
constructing rectangles on each of these smaller intervals to approximate the region under the 
curve on that subinterval, then summing the areas of these rectangles to approximate the area 
under the curve. We will extend this process in this section to its three-dimensional analogs, 
double Riemann sums and double integrals over rectangles. 

Preview Activity 11.1. In this activity we introduce the concept of a double Riemann sum. 

(a) Review the concept of the Riemann sum from single-variable calculus. Then, explain how 
we define the definite integral [ b f(x) dx of a continuous function of a single variable x 
on an interval [a, b\. Include a sketch of a continuous function on an interval [a, b] with 
appropriate labeling in order to illustrate your definition. 

(b) In our upcoming study of integral calculus for multivariable functions, we will first extend 
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the idea of the single-variable definite integral to functions of two variables over rectan¬ 
gular domains. To do so, we will need to understand how to partition a rectangle into 
subrectangles. Let R be rectangular domain R = {(x,y) : 0 < x < 6, 2 < y < 4} (we can 
also represent this domain with the notation [0,6] x [2,4]), as pictured in Figure 11.1. 

4- 


2 - 

0 6 

Figure 11.1: Rectangular domain R with subrectangles. 

To form a partition of the full rectangular region, R, we will partition both intervals [0, 6 ] 
and [2,4]; in particular, we choose to partition the interval [0, 6 ] into three uniformly sized 
subintervals and the interval [2,4] into two evenly sized subintervals as shown in Figure 
11.1. In the following questions, we discuss how to identify the endpoints of each subin¬ 
terval and the resulting subrectangles. 

i. Let 0 = xq < x\ < X 2 < 23 = 6 be the endpoints of the subintervals of [0, 6 ] after 
partitioning. What is the length Ax of each subinterval [x,_ 1 , x,;] for i from 1 to 3? 

ii. Explicitly identify xq, x \ , xy, and X 3 . On Figure 11.1 or your own version of the 
diagram, label these endpoints. 

iii. Let 2 = yo < £/i < 2/2 = 4 be the endpoints of the subintervals of [2, 4] after partition¬ 
ing. What is the length Ay of each subinterval yj\ for j from 1 to 2? Identify yo, 
yi, and yo and label these endpoints on Figure 11.1. 

iv. Let Rij denote the subrectangle [x,_ 1 , x,] x [y ? _ 1 , y/|. Appropriately label each sub¬ 
rectangle in your drawing of Figure 11.1. How does the total number of subrectangles 
depend on the partitions of the intervals [0, 6 ] and [2,4]? 

v. What is area A A of each subrectangle? 


CXI 


Double Riemann Sums over Rectangles 

For the definite integral in single-variable calculus, we considered a continuous function over a 
closed, bounded interval [a, 6]. In multivariable calculus, we will eventually develop the idea of a 
definite integral over a closed, bounded region (such as the interior of a circle). We begin with a 
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simpler situation by thinking only about rectangular domains, and will address more complicated 
domains in the following section. 

Let / = f(x, y) be a continuous function defined on a rectangular domain R = {(x. y) : a < 
x < b. c < y < d\. As we saw in Preview Activity 11.1, the domain is a rectangle R and we want to 
partition R into subrectangles. We do this by partitioning each of the intervals [a, b] and [c, d] into 
subintervals and using those subintervals to create a partition of R into subrectangles. In the first 
activity, we address the quantities and notations we will use in order to define double Riemann 
sums and double integrals. 

Activity 11.1. 

Let f(x,y ) = 100 — x 2 — y 2 be defined on the rectangular domain R = [ a,b} x [c, d]. Partition 
the interval [a, b] into four uniformly sized subintervals and the interval [c, d] into three evenly 
sized subintervals as shown in Figure 11.2. As we did in Preview Activity 11.1, we will need a 
method for identifying the endpoints of each subinterval and the resulting subrectangles. 



Figure 11.2: Rectangular domain with subrectangles. 


(a) Let a = xo < xi < X 2 < X 3 < X 4 = bbe the endpoints of the subintervals of [a, b] after 
partitioning. Label these endpoints in Figure 11.2. 

(b) What is the length Ax of each subinterval [x,_i, x, ? Your answer should be in terms of 
a and b. 

(c) Let c = yo < yi < V 2 < V 3 = d be the endpoints of the subintervals of [c, d] after 
partitioning. Label these endpoints in Figure 11.2. 

(d) What is the length Ay of each subinterval [y ? _ i. y ; ]? Your answer should be in terms of 
c and d. 

(e) The partitions of the intervals [a, b] and [c, d] partition the rectangle R into subrectangles. 
How many subrectangles are there? 

(f) Let Rij denote the subrectangle [x,_ i, ay] x [y ; _ 1 . y ? ]. Label each subrectangle in Figure 
11 . 2 . 

(g) What is area A A of each subrectangle? 
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(h) Now let [a, 6 ] = [0,8] and [c,d\ = [2,6]. Let (x^,^) be the point in the upper right 
corner of the subrectangle Rn. Identify and correctly label this point in Figure 11.2. 
Calculate the product 

f(x* u ,y* u )AA. 

Explain, geometrically, what this product represents. 

(i) For each i and j, choose a point (x*pUij) i n the subrectangle Identify and correctly 
label these points in Figure 11.2. Explain what the product 

f(x*j,y*j)AA 

represents. 

(j) If we were to add all the values f(x*j. y*-)AA for each i and j, what does the resulting 
number approximate about the surface defined by / on the domain R? (You don't 
actually need to add these values.) 

(k) Write a double sum using summation notation that expresses the arbitrary sum from 
part (j). 


<1 


Double Riemann Sums and Double Integrals 

Now we use the process from the most recent activity to formally define double Riemann sums 
and double integrals. 


Definition 11.1. Let / be a continuous function on a rectangle R = {(x , y) : a < x < b, c < y < 
d}. A double Riemann sum for / over R is created as follows. 

• Partition the interval [a, b] into m subintervals of equal length Ax = Let xq, x \ , ..., 
x m be the endpoints of these subintervals, where a = xo < x\ < X 2 < ■ ■ ■ < x m = b. 

• Partition the interval [c, d\ into n subintervals of equal length Ay = Let //q, y\, ..., 
y n be the endpoints of these subintervals, where c = ya < y\ < ij 2 < ■ ■ ■ < y n = d. 

• These two partitions create a partition of the rectangle R into ran subrectangles Rij with 
opposite vertices r/j-i) and ( Xi,yj ) for i between 1 and m and j between 1 and n. 
These rectangles all have equal area A A = Ax • Ay. 

• Choose a point (x*^. y*-) in each rectangle Rij. Then, a double Riemann sum for / over R 
is given by 

n m 

EE/H>^)- aa 

j= 1 i =1 
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If f(x,y) > 0 on the rectangle R, we may ask to find the volume of the solid bounded above 
by / over R, as illustrated on the left of Figure 11.3. This volume is approximated by a Riemann 
sum, which sums the volumes of the rectangular boxes shown on the right of Figure 11.3. 



Figure 11.3: The volume under a graph approximated by a Riemann Sum 


As we let the number of subrectangles increase without bound (in other words, as both m and 
n in a double Riemann sum go to infinity), as illustrated in Figure 11.4, the sum of the volumes of 
the rectangular boxes approaches the volume of the solid bounded above by / over R. The value 
of this limit, provided it exists, is the double integral. 



Figure 11.4: Finding better approximations by using smaller subrectangles. 
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Definition 11.2. Let R be a rectangular region in the x-y plane and / a continuous function 
over R. With terms defined as in a double Riemann sum, the double integral of / over R is 

n m 

j =i *=i 


// f{x,y)dA= lim 

J J R m,n—>o 


Interpretation of Double Riemann Sums and Double integrals. 

At the moment, there are two ways we can interpret the value of the double integral. 

• Suppose that f(x,y) assumes both positive and negatives values on the rectangle R, as 
shown on the left of Figure 11.5. When constructing a Riemann sum, for each i and j, the 
product ■ A A can be interpreted as a "signed" volume of a box with base area A A 

and "signed" height /(x*-,y*-). Since / can have negative values, this "height" could be 
negative. The sum 

n m 

1=1 *=l 

can then be interpreted as a sum of "signed" volumes of boxes, with a negative sign attached 
to those boxes whose heights are below the xy-plane. We can then realize the double integral 




Figure 11.5: The integral measures signed volume. 

ff R f(x, y) cLA as a difference in volumes: f f R f(x, y) dA tells us the volume of the solids the 
graph of / bounds above the .xy-plane over the rectangle R minus the volume of the solids 
the graph of / bounds below the xy-plane under the rectangle II. This is shown on the right 
of Figure 11.5. 
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• The average of the finitely many mn values /(x*-, y*j) that we take in a double Riemann sum 
is given by 

- n m 

Av s™ =— 

j =i *=i 

If we take the limit as m and n go to infinity, we obtain what we define as the average value 
of / over the region R, which is connected to the value of the double integral. First, to view 
Avg mn as a double Riemann sum, note that 


Ax 


b — a 
m 


and 


Ay 


d — c 
n 


Thus, 


1 Ax •Ay A A 

mn (b — a)(d — c ) A(R) ’ 

where A(R) denotes the area of the rectangle R. Then, the average value of the function / 
over R, / Av G (r), is given by 


f avg(r) = lim - f( x ij > 

m,n—>oo TflTl LL ' J 


ij i V 


ij/ 


j= 1 i=l 

n m 


= ds«, A(R) E E /<4. y>i) ■ A ^ 

3=1 »=1 

1 ' ' f(x,y)dA. 

IR 


A{R) 


Therefore, the double integral of / over /i divided by the area of R gives us the average 
value of the function / on R. Finally, if /(x. y) > 0 on II, we can interpret this average value 
of / on R as the height of the box with base R that has the same volume as the volume of the 
surface defined by / over R. 


Activity 11.2. 

Let /(x, y) = x + 2y and let R = [0, 2] x [1, 3]. 

(a) Draw a picture of R. Partition [0, 2] into 2 subintervals of equal length and the interval 
[1,3] into two subintervals of equal length. Draw these partitions on your picture of R 
and label the resulting subrectangles using the labeling scheme we established in the 
definition of a double Riemann sum. 

(b) For each i and j, let (x*j , y*-) be the midpoint of the rectangle R,j. Identify the coordi¬ 
nates of each Draw these points on your picture of R. 

(c) Calculate the Riemann sum 

n m 

3=1 i =1 
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using the partitions we have described. If we let (a;*-, $/*■) be the midpoint of the rectan¬ 
gle Rij for each i and j, then the resulting Riemann sum is called a midpoint sum. 

(d) Give two interpretations for the meaning of the sum you just calculated. 


<1 

Activity 11.3. 

Let f(x, y ) = on the rectangular domain R = [1, 7] x [—2, 2]. Partition [1, 7] into 3 equal 

length subintervals and [—2, 2] into 2 equal length subintervals. A table of values of / at some 
points in R is given in Table 11.1, and a graph of / with the indicated partitions is shown in 
Figure 11.6. 



-2 

-1 

0 

1 

2 

1 

0 

V3 

2 

v/3 

0 

2 

0 

V3 

2 

V3 

0 

3 

0 

V3 

2 

V3 

0 

4 

0 

V3 

2 

V3 

0 

5 

0 

V3 

2 

V3 

0 

6 

0 

V3 

2 

V3 

0 

7 

0 

V3 

2 

V3 

0 


Table 11.1: Table of values of 

f(x,y) = \/4-y 2 . 



y 

Figure 11.6: Graph of f(x,y) 
a/4 — y 2 on R. 


(a) Outline the partition of R into subrectangles on the table of values in Table 11.1. 

(b) Calculate the double Riemann sum using the given partition of R and the values of / in 
the upper right corner of each subrectangle. 

(c) Use geometry to calculate the exact value of y) dA and compare it to your ap¬ 

proximation. How could we obtain a better approximation? 


<1 

We conclude this section with a list of properties of double integrals. Since similar properties 
are satisfied by single-variable integrals and the arguments for double integrals are essentially the 
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same, we omit their justification. 

Properties of Double Integrals. Let / and g be continuous functions on a rectangle R = 

{(x, y) : a < x < b, c < y < d}, and let k be a constant. Then 

!■ ff R (f( x , y) + s(®, y)) dA = Hr f( x -< y) dA + ff R a( x , v) dA - 
2 - Hr */(*> y) dA = k ff R f( x , y) dA. 

3. If f(x,y) > g(x, y) on R, then fj R f(x, y) dA > Jf R g(x,y)dA. 


Summary 

• Let / be a continuous function on a rectangle R = {(x,y) : a < x < b. c < y < d). The double 
Riemann sum for / over II is created as follows. 

- Partition the interval [a, b\ into m subintervals of equal length Ax = Let xq, x\, ..., 
x m be the endpoints of these subintervals, where a = xo < x\ < X 2 < ■ ■ ■ < x m = b. 

- Partition the interval [c, d] into n subintervals of equal length Ay = Let yo, y\, ..., y n 
be the endpoints of these subintervals, where c = yo < yi < y 2 < ■ ■ • < y n = d. 

- These two partitions create a partition of the rectangle R into mn subrectangles R,jj with 

opposite vertices and ( Xi,yj ) for i between 1 and m and j between 1 and n. 

These rectangles all have equal area A A = Ax • Ay. 

- Choose a point (x* -, y*j ) in each rectangle R t:j . Then a double Riemann sum for / over R is 
given by 

n m 

AA 

j= 1 i =1 

• With terms defined as in the Double Riemann Sum, the double integral of / over R is 

n m 

j =i *=i 

• Two interpretations of the double integral J j f(x. y) dA are: 

- The volume of the solids the graph of / bounds above the xy-plane over the rectangle 
R minus the volume of the solids the graph of / bounds below the .xy-plane under the 
rectangle R ; 

- Dividing the double integral of / over R by the area of R gives us the average value of 
the function / on R. If fix. y) > 0 on II, we can interpret this average value of / on R as 
the height of the box with base R that has the same volume as the volume of the surface 
defined by / over R. 


R 


f(x,y)dA= lim 

m,n—> oo 
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Exercises 

1. The temperature at any point on a metal plate in the xy plane is given by T{x, y) = 100—4 x 2 —y 2 , 
where x and y are measured in inches and T in degrees Celsius. Consider the portion of the 
plate that lies on the rectangular region R = [1,5] x [3,6]. 

(a) Estimate the value of / f R T(x, y) dA by using a double Riemann sum with two subin¬ 
tervals in each direction and choosing (x *, y*) to be the point that lies in the upper right 
corner of each subrectangle. 

(b) Determine the area of the rectangle R. 

(c) Estimate the average temperature, T AVG (r), over the region R. 

(d) Do you think your estimate in (c) is an over- or under-estimate of the true temperature? 
Why? 

2. The wind chill, as frequently reported, is a measure of how cold it feels outside when the wind 
is blowing. In Table 11.2, the wind chill w = w(v,T), measured in degrees Fahrenheit, is a 
function of the wind speed v, measured in miles per hour, and the ambient air temperature 
T, also measured in degrees Fahrenheit. Approximate the average wind chill on the rectangle 
[5,35] x [—20, 20] using 3 subintervals in the v direction, 4 subintervals in the T direction, and 
the point in the lower left corner in each subrectangle. 


v\T 

-20 

-15 

-10 

-5 

0 

5 

10 

15 

20 

5 

-34 

-28 

-22 

-16 

-11 

-5 

1 

7 

13 

10 

-41 

-35 

-28 

-22 

-16 

-10 

-4 

3 

9 

15 

-45 

-39 

-32 

-26 

-19 

-13 

-7 

0 

6 

20 

-48 

-42 

-35 

-29 

-22 

-15 

-9 

-2 

4 

25 

-51 

-44 

-37 

-31 

-24 

-17 

-11 

-4 

3 

30 

-53 

-46 

-39 

-33 

-26 

-19 

-12 

-5 

1 

35 

-55 

-48 

-41 

-34 

-27 

-21 

-14 

-7 

0 


Table 11.2: Wind chill as a function of wind speed and temperature. 


3. Consider the box with a sloped top that is given by the following description: the base is the 
rectangle R = [0,4] x [0, 3], while the top is given by the plane z = p(x, y) = 20 — 2x — 3 y. 

(a) Estimate the value of ff R p(x, y) dA by using a double Riemann sum with four subin¬ 
tervals in the x direction and three subintervals in the y direction, and choosing (x *, y *) 
to be the point that is the midpoint of each subrectangle. 

(b) What important quantity does your double Riemann sum in (a) estimate? 
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(c) Suppose it can be determined that jf R p(x, y) dA = 138. What is the exact average value 
of p over R ? 

(d) If you wanted to build a rectangular box (with the same base) that has the same volume 
as the box with the sloped top described here, how tall would the rectangular box have 
to be? 
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11.2 Iterated Integrals 


Motivating Questions 


In this section, we strive to understand the ideas generated by the following important questions: 


• How do we evaluate a double integral over a rectangle as an iterated integral, and why 
does this process work? 


Introduction 

Recall that we defined the double integral of a continuous function / = f(x, y) over a rectangle 

R = [a, b\ x [c, d] as 

n m 

12 12 f (4> y*j) • 

3 =1 i =1 

where the different variables and notation are as described in Section 11.1. Thus f f R f(x,y ) dA is 
a limit of double Riemann sums, but while this definition tells us exactly what a double integral 
is, it is not very helpful for determining the value of a double integral. Fortunately, there is a way 
to view a double integral as an iterated integral, which will make computations feasible in many 
cases. 

The viewpoint of an iterated integral is closely connected to an important idea from single¬ 
variable calculus. When we studied solids of revolution, such as the one shown in Figure 11.7, we 
saw that in some circumstances we could slice the solid perpendicular to an axis and have each 
slice be approximately a circular disk. From there, we were able to find the volume of each disk, 
and then use an integral to add the volumes of the slices. In what follows, we are able to use single 
integrals to generalize this approach to handle even more general geometric shapes. 


f(x, y) dA = lim 


R 


m.n—>oo 



Figure 11.7: A solid of revolution. 
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Preview Activity 11.2. Let f(x, y) = 25 — x 2 — y 2 on the rectangular domain R = [—3, 3] x [—4,4]. 

As with partial derivatives, we may treat one of the variables in / as constant and think of 
the resulting function as a function of a single variable. Now we investigate what happens if we 
integrate instead of differentiate. 

(a) Choose a fixed value of x in the interior of [—3,3]. Let 

A{x) = J f( x , y) d V- 

What is the geometric meaning of the value of A(x) relative to the surface defined by /. 
(Hint: Think about the trace determined by the fixed value of x, and consider how A(x) is 
related to Figure 11.8.) 



Figure 11.8: A cross section with fixed 
x. 



Figure 11.9: A cross section with fixed 
x and Ax. 


(b) For a fixed value of x, say x*, what is the geometric meaning of A(x *) Ax? (Hint: Consider 
how A(x*)Ax is related to Figure 11.9.) 

(c) Since / is continuous on R, we can define the function A = A[x) at every value of x 
in [—3,3]. Now think about subdividing the x-interval [—3,3] into rn subintervals, and 
choosing a value x* in each of those subintervals. What will be the meaning of the sum 

m 

J2 a (x*)Ax? 

i= 1 

(d) Explain why A(x) dx will determine the exact value of the volume under the surface 
z = f(x,y) over the rectangle R. 


CXI 
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Iterated Integrals 


The ideas that we explored in Preview Activity 11.2 work more generally and lead to the idea of 
an iterated integral. Let / be a continuous function on a rectangular domain R = [a, b]x [c, d], and 
let 


A(x) = / f(x, y) dy. 


The function A = A(x ) determines the value of the cross sectional area 1 in the y direction for the 
fixed value of x of the solid bounded between the surface defined by / and the ary-plane. 





Figure 11.10: Summing cross section slices. 


The value of this cross sectional area is determined by the input x in A. Since A is a function of 
x, it follows that we can integrate A with respect to x. In doing so, we use a partition of [a, b] and 
make an approximation to the integral given by 


A(x)dx 


^ ^A(x*)Ax , 

i= 1 


where x* is any number in the subinterval [x,;_], x,]. Each term A(x*)Ax in the sum represents an 
approximation of a fixed cross sectional slice of the surface in the y direction with a fixed width of 
Ax as illustrated in Figure 11.9. We add the signed volumes of these slices as shown in the frames 
in Figure 11.10 to obtain an approximation of the total signed volume. 

As we let the number of subintervals in the x direction approach infinity, we can see that 

m 

the Riemann sum A{x*)Ax approaches a limit and that limit is the sum of signed volumes 

i= 1 

bounded by the function / on R. Therefore, since A(x) is itself determined by an integral, we have 



f( x , y) dA 


m 

lim Y A(x*)Ax 

m—>• oo ^ J 
i =1 


A(x)dx 



f(x,y)dy 


dx. 


: By area we mean "signed" area. 
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Hence, we can compute the double integral of / over R by first integrating / with respect to y 
on [c, d], then integrating the resulting function of x with respect to x on [a, b]. The nested integral 

f*b / pd \ pb pd 

f(x,y)dy) dx = / / f(x,y) dy dx 


> a \J c 


is called an iterated integral, and we see that each double integral may be represented by two single 
integrals. 

We made a choice to integrate first with respect to y. The same argument shows that we can 
also find the double integral as an iterated integral integrating with respect to x first, or 

pd / pb \ pd pb 

f(x,y)dA= I / f(x,y)dx)dy= / f(x,y)dxdy. 

R J c \J a J J c J a 

The fact that integrating in either order results in the same value is known as Fubini's Theorem. 


Fubini's Theorem. If / = f(x, y) is a continuous function on a rectangle R = [a, b] x [c, d], then 

pd pb pb pd 

f(x,y)dA= / / f(x,y) dx dy = / / f(x,y)dydx. 


R 


Fubini's theorem enables us to evaluate iterated integrals without resorting to the limit def¬ 
inition. Instead, working with one integral at a time, we can use the Fundamental Theorem of 
Calculus from single-variable calculus to find the exact value of each integral, starting with the 
inner integral. 

Activity 11.4. 

Let f(x, y) = 25 — x 2 — y 2 on the rectangular domain R = [—3, 3] x [—4,4], 

(a) Viewing r as a fixed constant, use the Fundamental Theorem of Calculus to evaluate 
the integral 

A{x) = J f(x,y)dy. 

Note that you will be integrating with respect to y, and holding x constant. Your result 
should be a function of x only. 

(b) Next, use your result from (a) along with the Fundamental Theorem of Calculus to 

f 3 

determine the value of J A(x) dx. 


(c) 


What is the value of J J f(x, y) dA ? What are two different ways we may interpret the 
meaning of this value? 


<1 


Activity 11.5. 
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Let f(x, y) = x + y 2 on the rectangle R = [0, 2] x [0, 3]. 


(a) 


Evaluate j J f(x,y) dA using an iterated integral. Choose an order for integration by 
deciding whether you want to integrate first with respect to x or y. 

(b) Evaluate Jj f(x, y) dA using the iterated integral whose order of integration is the 


opposite of the order you chose in (a). 


<1 


Summary 


• We can evaluate the double integral 
iterated integral in one of two ways: 



f(x,y) dA over a rectangle R 


- fX[ i{x ' v)ix ) 


dx, or 

dy. 


[a, b] x [c, d] as an 


This process works because each inner integral represents a cross-sectional (signed) area and 
the outer integral then sums all of the cross-sectional (signed) areas. Fubini's Theorem guaran¬ 
tees that the resulting value is the same, regardless of the order in which we integrate. 


Exercises 

1. Evaluate each of the following double or iterated integrals exactly. 

(a) 

(b) 


7r/4 


xy dy ) dx 
rn/3 


sin(x) cos(y) dx dy 


(c) (_ ^e~ 2x ~ 3y dy ) dx 

a/2x + 5 y dA, where R = [0, 2] x [0, 3]. 


(d) 


2. The temperature at any point on a metal plate in the xy plane is given by T(x, y) = 100—4.x 2 — y 2 , 
where x and y are measured in inches and T in degrees Celsius. Consider the portion of the 
plate that lies on the rectangular region R = [1, 5] x [3, 6]. 

(a) Write an iterated integral whose value represents the volume under the surface T over 
the rectangle R. 
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(b) Evaluate the iterated integral you determined in (a). 

(c) Find the area of the rectangle, R. 

(d) Determine the exact average temperature, T AVG(R) , over the region R. 

3. Consider the box with a sloped top that is given by the following description: the base is the 
rectangle R = [1,4] x [2, 5], while the top is given by the plane z = p(x, y) = 30 — x — 2 y. 

(a) Write an iterated integral whose value represents the volume under p over the rectangle 
R, 

(b) Evaluate the iterated integral you determined in (a). 

(c) What is the exact average value of p over R? 

(d) If you wanted to build a rectangular box (with an identical base) that has the same 
volume as the box with the sloped top described here, how tall would the rectangular 
box have to be? 
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11.3 Double Integrals over General Regions 

Motivating Questions 

In this section, we strive to understand the ideas generated by the following important questions: 

• How do we define a double integral over a non-rectangular region? 

• What general form does an iterated integral over a non-rectangular region have? 


Introduction 


Recall that we defined the double integral of a continuous function / = f(x, y) over a rectangle 

R = [a, b] x [c, d] as 

~ ~ n m 

f(x,y)dA= lim VV/ (x* ) , yf) ■ AA, 

m..n.—±no * J J 


R 


j =1 i= 1 


where the notation is as described in Section 11.1. Furthermore, we have seen that we can evaluate 
a double integral JJ f(x,y ) dA over R as an iterated integral of either of the forms 


b pd rd rb 

f(x,y)dydx or / / f(x,y)dxdy. 


It is natural to wonder how we might define and evaluate a double integral over a non- 
rectangular region; we explore one such example in the following preview activity. 

Preview Activity 11.3. A tetrahedron is a three-dimensional figure with four faces, each of which 
is a triangle. A picture of the tetrahedron T with vertices (0, 0, 0), (1,0,0), (0,1,0), and (0, 0,1) is 
shown in Figure 11.11. If we place one vertex at the origin and let vectors a, b, and c be determined 
by the edges of the tetrahedron that have one end at the origin, then a formula that tells us the 
volume V of the tetrahedron is 

V= J|a-(bxc)|. (11.1) 

b 


(a) Use the formula (11.1) to find the volume of the tetrahedron T. 

(b) Instead of memorizing or looking up the formula for the volume of a tetrahedron, we can 
use a double integral to calculate the volume of the tetrahedron T. To see how, notice that 
the top face of the tetrahedron T is the plane whose equation is 

z = l — (x + y). 
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Figure 11.11: The tetrahedron T. 


Figure 11.12: Projecting T onto the 
xy-plane. 


Provided that we can use an iterated integral on a non-rectangular region, the volume of 
the tetrahedron will be given by an iterated integral of the form 



1 — (x + y) dy dx. 


The issue that is new here is how we find the limits on the integrals; note that the outer 
integral's limits are in x, while the inner ones are in y, since we have chosen dA = dy dx. 
To see the domain over which we need to integrate, think of standing way above the tetra¬ 
hedron looking straight down on it, which means we are projecting the entire tetrahedron 
onto the xy-plane. The resulting domain is the triangular region shown in Figure 11.12. 


Explain why we can represent the triangular region with the inequalities 

0 < y < 1 — x and 0 < x < 1. 

(Hint: Consider the cross sectional slice shown in Figure 11.12.) 

(c) Explain why it makes sense to now write the volume integral in the form 


x=? ry =? 


px= i r 

J x=? J v 


1 — (x + y) dy dx = 


f*x=l ry=1—x 


1 — (x + y) dy dx. 


/ x=! J y=( J x=0 J y =0 

(d) Use the Fundamental Theorem of Calculus to evaluate the iterated integral 


x=l ry=1—x 


rx=l (■ 

J £=0 J y =0 


and compare to your result from part (a), 
regions, start with the inner integral.) 


1 — (x + y) dy dx 

(As with iterated integrals over rectangular 
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Double Integrals over General Regions 

So far, we have learned that a double integral over a rectangular region may be interpreted in one 
of two ways: 


II f(x, y ) dA tells us the volume of the solids the graph of / bounds above the xy-plane 

over the rectangle R minus the volume of the solids the graph of / bounds below the xy- 
plane under the rectangle R ; 


• / / f(x, y) dA, where A(R) is the area of R tells us the average value of the function / 

A\R) J Jr 

on R. If f(x, y) > 0 on R, we can interpret this average value of / on R as the height of the 
box with base R that has the same volume as the volume of the surface defined by / over II. 


As we saw in Preview Activity 11.1, a function / = f(x,y) may be considered over regions 
other than rectangular ones, and thus we want to understand how to set up and evaluate double 
integrals over non-rectangular regions. Note that if we can, then the two interpretations of the 
double integral noted above will naturally extend to solid regions with non-rectangular bases. 

So, suppose / is a continuous function on a closed, bounded domain D. For example, consider 
D as the circular domain shown in Figure 11.13. We can enclose Pina rectangular domain R as 




Figure 11.13: A non-rectangular do- Figure 11.14: Enclosing this domain 

main. in a rectangle. 


shown in Figure 11.14 and extend the function / to be defined over R in order to be able to use 
the definition of the double integral over a rectangle. We extend / in such a way that its values at 
the points in R that are not in D contribute 0 to the value of the integral. In other words, define a 
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function F = F(x, y) on R as 


F{x,y) 


f(x,y), if (x,y) G D. 
0, if {x,y)£D 


We then say that the double integral of / over D is the same as the double integral of F over R, 
and thus 


f(x,y)dA 


F(x, y) dA. 


JJd JJr 

In practice, we just ignore everything that is in R but not in D, since these regions contribute 0 to 
the value of the integral. 


Just as with double integrals over rectangles, a double integral over a domain D can be evalu¬ 
ated as an iterated integral. If the region D can be described by the inequalities gi(x) < y < g-A/x) 
and a < x < b, where g\ = g\ (x) and ry 2 = <y 2 (x) are functions of only x, then 



f(x,y) dA 



ry=92(x) 

'y=gi{x) 


f(x,y)dydx. 


Alternatively, if the region D is described by the inequalities h\(y) < x < h->(y) and c < y < d, 
where h\ = h\ (y) and /) 2 = // 2 (?y) are functions of only y, we have 



f(x,y) dA 


ry=d px=h 2 (y) 
ly=c Jx=h 1 (y) 


f(x, y) dx dy. 


The structure of an iterated integral is of particular note: 

In an iterated double integral: 

• the limits on the outer integral must be constants; 

• the limits on the inner integral must be constants or in terms of only the remaining vari¬ 
able - that is, if the inner integral is with respect to y, then its limits may only involve x 
and constants, and vice versa. 


We next consider a detailed example. 

Example 11.1. Let f(x, y) = x 2 y be defined on the triangle D with vertices (0, 0), (2,0), and (2,3) 
as shown in Figure 11.15. To evaluate jj f(x, y) dA, we must first describe the region D in terms 
of the variables x and y. We take two approaches. 


Approach 1: Integrate first with respect to y. 

gral as an iterated integral in the form 


In this case we choose to evaluate the double inte- 



x=b fy=g 2 {x) 


=0 r 
=a J y 


x 2 y dy dx , 


x=a Jy=g\(x) 
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Figure 11.15: A triangular 
domain. 



Figure 11.16: Slices in the 
y direction. 



l 2 3 


Figure 11.17: Slices in the 
x direction. 


and therefore we need to describe D in terms of inequalities 


g\ (x) <y< g 2 (x) and a < x < b. 


Since we are integrating with respect to y first, the iterated integral has the form 



rx=b 


A(x)dx, 


where A(x) is a cross sectional area in the y direction. So we are slicing the domain perpen¬ 
dicular to the rc-axis and want to understand what a cross sectional area of the overall solid 
will look like. Several slices of the domain are shown in Figure 11.16. On a slice with fixed x 
value, the y values are bounded below by 0 and above by the y coordinate on the hypotenuse 
of the right triangle. Thus, gi(x) = 0; to find y = we need to write the hypotenuse as 

a function of x. The hypotenuse connects the points (0,0) and (2,3) and hence has equation 
y = \x. This gives the upper bound on y as §2 [x) = \x. The leftmost vertical cross section is 
at x = 0 and the rightmost one is at x = 2, so we have a = 0 and 6 = 2. Therefore, 



rx=2 


ry= l x 


x 2 y dy dx. 


Ix=0 Jy =0 


We evaluate the iterated integral by applying the Fundamental Theorem of Calculus first to 
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the inner integral, and then to the outer one, and find that 



Approach 2: Integrate first with respect to x. In this case, we choose to evaluate the double inte¬ 
gral as an iterated integral in the form 



ry=d rx=h 2 (y) 

ly=c J x=h\ (y) 


x 2 y dx dy 


and thus need to describe D in terms of inequalities 


h\(y) < x < h 2 (y) and c <y < d. 


Since we are integrating with respect to x first, the iterated integral has the form 



Mv) d Vi 


where A(y) is a cross sectional area of the solid in the x direction. Several slices of the domain 
- perpendicular to the y-axis - are shown in Figure 11.17. 

On a slice with fixed y value, the x values are bounded below by the x coordinate on the 
hypotenuse of the right triangle and above by 2. So h 2 (y) = 2; to find h \ (y), we need to write 
the hypotenuse as a function of y. Solving the earlier equation we have for the hypotenuse 
(y = |x) for x gives us x = | y. This makes h \ (y) = | y. The lowest horizontal cross section 
is at y = 0 and the uppermost one is at y = 3, so we have c = 0 and d = 3. Therefore, 

rr 9 fy = 3 r x = 2 9 

// xydA= / / xydxdy. 

JjD J y=0 Jx=(2/3)y 


We evaluate the resulting iterated integral as before by twice applying the Fundamental 
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Theorem of Calculus, and find that 

rv =3 r 2 


x y dx dy 


>y= o ->x=\y 





x=2 

y dx 

x =h 


81 1 


dy 



We see, of course, that in the situation where D can be described in two different ways, the order 
in which we choose to set up and evaluate the double integral doesn't matter, and the same value 
results in either case. 


The meaning of a double integral over a non-rectangular region, D , parallels the meaning over 
a rectangular region. In particular. 


f(x,y ) dA tells us the volume of the solids the graph of / bounds above the xy-plane 


J J D 

over the closed, bounded region D minus the volume of the solids the graph of / bounds 
below the xy-plane under the region D; 


• ~A7~n\ / / /(-A v) where .4(74) is the area of D tells us the average value of the function 
A(Dj J J R 

f on D. If f(x, y) > 0 on D, we can interpret this average value of / on D as the height of 
the solid with base D and constant cross-sectional area D that has the same volume as the 
volume of the surface defined by / over D. 


Activity 11.6. 

Consider the double integral 
(0,0), (4,0), and (0,2). 


D 


(4 — x — 2 y) dA, where D is the triangular region with vertices 


(a) Write the given integral as an iterated integral of the form J J (4 — x — 2 y) dy dx. Draw 
a labeled picture of D with relevant cross sections. 

(b) Write the given integral as an iterated integral of the form j J (4 — x — 2 y) dx dy. Draw 
a labeled picture of D with relevant cross sections. 















11.3. DOUBLE INTEGRALS OVER GENERAL REGIONS 


205 


(c) Evaluate the two iterated integrals from (a) and (b), and verify that they produce the 
same value. Give at least one interpretation of the meaning of your result. 


<1 


Activity 11.7. 

Consider the iterated integral 


fx=5 ry=x 2 


ry=x 

/ (Ax + lOy) dy dx. 
J v=—x 


Jx =3 Jy=—x 

(a) Sketch the region of integration, D, for which 


(Ax + lOy) dA = 


rx=5 ry=x 2 


D 


Jx =3 Jy——x 


(Ax + lOy) dydx. 


(b) Determine the equivalent iterated integral that results from integrating in the opposite 
order (dx dy, instead of dy dx). That is, determine the limits of integration for which 


D 


ry=l px=? 

(Ax + 10y) dA= / (Ax + lOy) dx dy. 

Jy=? Jx=l 


(c) Evaluate one of the two iterated integrals above. Explain what the value you obtained 
tells you. 

(d) Set up and evaluate a single definite integral to determine the exact area of D, A(D). 

(e) Determine the exact average value of f(x, y) = Ax + lOy over D. 




Activity 11.8. 

rX=A p 

Consider the iterated integral / / 

J x=0 J v 


x=4 py =2 


e v dydx. 


x=0 Jy=x/2 

2 

(a) Explain why we cannot antidifferentiate e y with respect to y, and thus are unable to 

rx =4 ry =2 2 

evaluate the iterated integral / / e ?r dy dx using the Fundamental Theorem of 

J x =0 Jy=x/2 

Calculus. 


(b) Sketch the region of integration, D, so that j j i 


dA = 


rx =4 py=2 


I x =0 Jy=x/2 


e y dydx. 


(c) Rewrite the given iterated integral in the opposite order, using dA = dx dy. 

(d) Use the Fundamental Theorem of Calculus to evaluate the iterated integral you devel¬ 
oped in (d). Write one sentence to explain the meaning of the value you found. 

(e) What is the important lesson this activity offers regarding the order in which we set up 
an iterated integral? 


@®@© 
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Summary 


For a double integral J J f(x. y) dA over a non-rectangular region D, we enclose D in a rectan¬ 
gle R and then extend integrand / to a function F so that F(x, y) = 0 at all points in R outside 
of D and F(x, y) = f(x, y) for all points in D. We then define JJ f(x, y) dA to be equal to 

F(x,y)dA. 


R 


• In an iterated double integral, the limits on the outer integral must be constants while the limits 
on the inner integral must be constants or in terms of only the remaining variable. In other 
words, an iterated double integral has one of the following forms (which result in the same 
value): 


rx=b ry=g 2 (x) 
J x=a Jy=gi(x) 


f(x,y) dy dx, 


where g\ = g\ (x) and ()■) = g-Ax) are functions of x only and the region D is described by the 
inequalities gi(x) < y < (x) and a < x < b or 


ry=d rx=h 2 (y) 
' y=c Jx=hi(y) 


f(x,y)dxdy , 


where hi = In (y) and hx = h‘i (y) are functions of y only and the region D is described by the 
inequalities h\{y) < x < h 2 {y) and c < y < d. 


Exercises 


1. For each of the following iterated integrals, (a) sketch the region of integration, (b) write an 
equivalent iterated integral expression in the opposite order of integration, and (c) choose one 
of the two orders and evaluate the integral. 


rx=l ry=x 

(a) / / xy dy dx 

J x=0 J y=x 2 

ry =2 rx=0 

(b) / / ,_ xy dx dy 

J y= 0 J X- 


= 0 Jx = -y/4^1 


rx=l py=x 1 / 4: 

(c) / / x + ydydx 

J x=0 J v- 


(d) 


r=0 J y=x 4 

ry =2 rx=2y 


'y= 0 Jx=y/2 


x + ydxdy 


@©®@ 
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2. The temperature at any point on a metal plate in the xy plane is given by T(x,y) = 100—4.x 2 —y 2 , 
where x and y are measured in inches and T in degrees Celsius. Consider the portion of the 
plate that lies on the region D that is the finite region that lies between the parabolas x = y 2 

and x = 3 — 2 y 2 . 

(a) Construct a labeled sketch of the region D. 

(b) Set up an integrated integral whose value is ff D T(x, y) dA. 

(c) Set up an integrated integral whose value is f j D T(x. y) dA. 

(d) Use the Fundamental Theorem of Calculus to evaluate the integrals you determined in 

(b) and (c). 

(e) Determine the exact average temperature, T AVG(D) , over the region D. 

3. Consider the solid that is given by the following description: the base is the given region D, 
while the top is given by the surface z = p(x,y). In each setting below, set up, but do not 
evaluate, an iterated integral whose value is the exact volume of the solid. Include a labeled 
sketch of D in each case. 

(a) D is the interior of the quarter circle of radius 2, centered at the origin, that lies in the 
second quadrant of the plane; p(x, y) = 16 — x 2 — y 2 . 

(b) D is the finite region between the line y = x + 1 and the parabola y = x 2 ; 

P(x,y) = 10 - x - 2y. 

(c) D is the triangular region with vertices (1,1), (2, 2), and (2, 3); p(x, y) = e~ xy . 

(d) D is the region bounded by the y-axis, y = 4 and x = p(x, y) = >/l + x 2 + y 2 . 

rx=A ry =2 

4. Consider the iterated integral I = / / cos (y 3 )dydx. 

J x= 0 J y=\fx 

(a) Sketch the region of integration. 

(b) Write an equivalent iterated integral with the order of integration reversed. 

(c) Choose one of the two orders of integration and evaluate the iterated integral you chose 
by hand. Explain the reasoning behind your choice. 

(d) Determine the exact average value of cos(y 3 ) over the region I) that is determined by 
the iterated integral I. 
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11.4 Applications of Double Integrals 

Motivating Questions 

In this section, we strive to understand the ideas generated by the following important questions: 


• If we have a mass density function for a lamina (thin plate), how does a double integral 
determine the mass of the lamina? 

• How may a double integral be used to find the area between two curves? 

• Given a mass density function on a lamina, how can we find the lamina's center of mass? 

• What is a joint probability density function? How do we determine the probability of an 
event if we know a probability density function? 


Introduction 

So far, we have interpreted the double integral of a function / over a domain D in two different 
ways. First, ff D fix, y) dA tells us a difference of volumes - the volume the surface defined by 
/ bounds above the xy-plane on D minus the volume the surface bounds below the .xy-plane on 
D. In addition, / Jo / (-A v) dA determines the average value of / on I). In this section, we 

investigate several other applications of double integrals, using the integration process as seen in 
Preview Activity 11.4: we partition into small regions, approximate the desired quantity on each 
small region, then use the integral to sum these values exactly in the limit. 

The following preview activity explores how a double integral can be used to determine the 
density of a thin plate with a mass density distribution. Recall that in single-variable calculus, 
we considered a similar problem and computed the mass of a one-dimensional rod with a mass- 
density distribution. There, as here, the key idea is that if density is constant, mass is the product 
of density and volume. 

Preview Activity 11.4. Suppose that we have a flat, thin object (called a lamina) whose density 
varies across the object. We can think of the density on a lamina as a measure of mass per unit 
area. As an example, consider a circular plate D of radius 1 cm centered at the origin whose 
density 5 varies depending on the distance from its center so that the density in grams per square 
centimeter at point (x, y) is 

6(x,y) = 10 - 2 (x 2 + y 2 ). 


(a) Suppose that we partition the plate into subrectangles Rij, where 1 < i < m and 1 < j < n, 
of equal area A A, and select a point (xf, y*f) in R, VJ for each i and j. 

What is the meaning of the quantity 5(x*-, y*-)AA? 

(b) State a double Riemann sum that provides an approximation of the mass of the plate. 
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(c) Explain why the double integral 


tells us the exact mass of the plate. 



6(x, y) dA 


(d) Determine an iterated integral which, if evaluated, would give the exact mass of the plate. 
Do not actually evaluate the integral. 2 


IX 


Mass 

Density is a measure of some quantity per unit area or volume. For example, we can measure the 
human population density of some region as the number of humans in that region divided by the 
area of that region. In physics, the mass density of an object is the mass of the object per unit area 
or volume. As suggested by Preview Activity 11.4, the following holds in general. 

If S(x, y) describes the density of a lamina defined by a planar region D, then the mass of D is 
given by the double integral JJ 5(x, y) dA. 


Activity 11.9. 

Let D be a half-disk lamina of radius 3 in quadrants IV and I, centered at the origin as shown 
in Figure 11.18. Assume the density at point (x, y) is given by S(x, y) = x. Find the exact mass 
of the lamina. 



Figure 11.18: A half disk lamina. 


<1 


2 This integral is considerably easier to evaluate in polar coordinates, which we will learn more about in Section 11.5. 


@©@© 
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Area 

If we consider the situation where the mass-density distribution is constant, we can also see how 
a double integral may be used to determine the area of a region. Assuming that 5(x,y) = 1 over a 
closed bounded region D, where the units of 6 are "mass per unit of area," it follows that ff D 1 dA 
is the mass of the lamina. But since the density is constant, the numerical value of the integral is 
simply the area. 

As the following activity demonstrates, we can also see this fact by considering a three-dimensional 
solid whose height is always 1. 

Activity 11.10. 

Suppose we want to find the area of the bounded region D between the curves 

y = 1 — x 2 and y = x — 1. 

A picture of this region is shown in Figure 11.19. 

(a) We know that the volume of a solid with constant height is given by the area of the base 
times the height. Hence, we may interpret the area of the region D as the volume of a 
solid with base D and of uniform height 1. Determine a double integral whose value is 
the area of D. 



Figure 11.19: The graphs of y = 1 — x 2 and y = x — 1. 


(b) Write an iterated integral whose value equals the double integral you found in (a). 

(c) Use the Fundamental Theorem of Calculus to evaluate only the inner integral in the 
iterated integral in (b). 

(d) After completing part (c), you should see a standard single area integral from calc II. 
Evaluate this remaining integral to find the exact area of D. 


@©@© 
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We now formally state the conclusion from our earlier discussion and Activity 11.10. 


Given a closed, bounded region D in the plane, the area of D, denoted A(D), is given by the 
double integral 



IdA. 


Center of Mass 

The center of mass of an object is a point at which the object will balance perfectly. For example, 
the center of mass of a circular disk of uniform density is located at its center. For any object, if 
we throw it through the air, it will spin around its center of mass and behave as if all the mass is 
located at the center of mass. 

In order to understand the role that integrals play in determining the center of a mass of an 
object with a nonuniform mass distribution, we start by finding the center of mass of a collection 
of N distinct point-masses in the plane. 

Let mi, m 2 , ..., rriN be N masses located in the plane. Think of these masses as connected by 
rigid rods of negligible weight from some central point (x,y). A picture with four masses is shown 
in Figure 11.20. Now imagine balancing this system by placing it on a thin pole at the point (x, y) 
perpendicular to the plane containing the masses. Unless the masses are perfectly balanced, the 
system will fall off the pole. The point (x, y) at which the system will balance perfectly is called 
the center of mass of the system. Our goal is to determine the center of mass of a system of discrete 
masses, then extend this to a continuous lamina. 



Figure 11.20: A center of mass ( x , y) of four masses. 


Each mass exerts a force (called a moment) around the lines x = x and y = y that causes 
the system to tilt in the direction of the mass. These moments are dependent on the mass and 
the distance from the given line. Let (x\,yi) be the location of mass mi, (m2,2/2) the location of 
mass m 2 , etc. In order to balance perfectly, the moments in the x direction and in the y direction 
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must be in equilibrium. We determine these moments and solve the resulting system to find the 
equilibrium point (x, y) at the center of mass. 

The force that mass mi exerts to tilt the system from the line y = y is 

mig(y-yi), 

where g is the gravitational constant. Similarly, the force mass m 2 exerts to tilt the system from the 
line y = y is 

m2g(y-y2)- 

In general, the force that mass nip. exerts to tilt the system from the line y = y is 

mkg(y-yk)- 


For the system to balance, we need the forces to sum to 0, so that 


N 

^2 m k g{y - y k ) = 0. 
k =1 


Solving for y, we find that 

sr^N 

— _ Efc= 1 m kVk 

U ■sr-^N 

Efc=1 m k 

A similar argument shows that 

V-'N./V 

_ Efc=1 m k x k 

z2k =1 m k 


The value M x = Ylk= 1 711 kVk is called the total moment with respect to the x-axis; M y = J2 k =i m k x k 
is the total moment with respect to the y-axis. Hence, the respective quotients of the moments to 
the total mass, M, determines the center of mass of a point-mass system: 


(x,y) 


(My MA 
V M ’ M ) ' 


Now, suppose that rather than a point-mass system, we have a continuous lamina with a 
variable mass-density S(x, y). We may estimate its center of mass by partitioning the lamina into 
rnn subrectangles of equal area A A, and treating the resulting partitioned lamina as a point-mass 
system. In particular, we select a point (x* :) , y* ; ) in the ij th subrectangle, and observe that the 
quanity 

$( x *j,y*j) AA 

is density times area, so 5(x*j,y*j)AA approximates the mass of the small portion of the lamina 
determined by the subrectangle Rij. 

We now treat <5 (x*- , y\ 3 ) A A as a point mass at the point The coordinates (x, y) of the 

center of mass of these mn point masses are thus given by 


x = 


E"=i HT=i 5 ( x ij^ij) AA 


and 


y = 


Ej=i E”ii yij s ( x ij,y*ij) AA 

E”=iEr=i^Hv4-)AA 


@®®@ 








11.4. APPLICATIONS OF DOUBLE INTEGRALS 


213 


If we take the limit as m and n go to infinity, we obtain the exact center of mass (x. y) of the 
continuous lamina. 

The coordinates (x, y) of the center of mass of a lamina D with density 5 = S(x, y) are given 
by 

- _ IId x6 ( x > y) dA A -- SSd v s ( x > y) dA 

II D d ( x ^y) dA an y ff D s ( x ^y) dA ' 


The numerators of x and y are called the respective moments of the lamina about the coordinate 
axes. Thus, the moment of a lamina D with density 5 = 5(x, y) about the y-axis is 


My = 

and the moment of D about the x-axis is 



x5(x, y) dA 


M x 



yd(x,y) dA. 


If M is the mass of the lamina, it follows that the center of mass is (x, y) 

Activity 11.11. 


(My MA 
y M ’ M ) ' 


In this activity we determine integrals that represent the center of mass of a lamina D described 
by the triangular region bounded by the x-axis and the lines x = 1 and y = 2x in the first 
quadrant if the density at point (x, y) is 5(x, y) = 6x + (jy + 6. A picture of the lamina is shown 
in Figure 11.21. 



Figure 11.21: The lamina bounded by the x-axis and the lines x = 1 and y = 2x in the first 
quadrant. 

(a) Set up an iterated integral that represents the mass of the lamina. 

(b) Assume the mass of the lamina is 14. Set up two iterated integrals that represent the 
coordinates of the center of mass of the lamina. 


<1 
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Probability 

Calculating probabilities is a very important application of integration in the physical, social, and 
life sciences. To understand the basics, consider the game of darts in which a player throws a dart 
at a board and tries to hit a particular target. Let us suppose that a dart board is in the form of 
a disk D with radius 10 inches. If we assume that a player throws a dart at random, and is not 
aiming at any particular point, then it is equally probable that the dart will strike any single point 
on the board. For instance, the probability that the dart will strike a particular 1 square inch region 
is jy^, or the ratio of the area of the desired target to the total area of D (assuming that the dart 
thrower always hits the board itself at some point). Similarly, the probability that the dart strikes 
a point in the disk D 3 of radius 3 inches is given by the area of D 3 divided by the area of D. In 
other words, the probability that the dart strikes the disk D 3 is 

= [[ 

1007T J J Ds 1007T 

The integrand, may be thought of as a distribution function, describing how the dart strikes 
are distributed across the board. In this case the distribution function is constant since we are 
assuming a uniform distribution, but we can easily envision situations where the distribution 
function varies. For example, if the player is fairly good and is aiming for the bulls eye (the center 
of D), then the distribution function / could be skewed toward the center, say 

f(x,y) = Ke~^ + ^ 


for some constant positive K. If we assume that the player is consistent enough so that the dart 
always strikes the board, then the probability that the dart strikes the board somewhere is 1, and 
the distribution function / will have to satisfy 3 



f(x,y) dA = 1. 


For such a function /, the probability that the dart strikes in the disk D\ of radius 1 would be 



f(x,y) dA. 


Indeed, the probability that the dart strikes in any region R that lies within D is given by 



f(x,y) dA. 


The preceding discussion highlights the general idea behind calculating probabilities. We as¬ 
sume we have a joint probability density function f, a function of two independent variables x and 
y defined on a domain D that satisfies the conditions 

3 This makes K = ,— 1 which vou can check. 

7r(l-e- 100 ) 3 
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• f(x, y) > 0 for all x and y in D, 

• the probability that x is between some values a and b while y is between some values c and 
d is given by 

f(x,y) dy dx, 

• The probability that the point (x, y) is in I) is 1, that is 




f(x,y) dA = 1. 


( 11 . 2 ) 


Note that it is possible that D could be an infinite region and the limits on the integral in Equa¬ 
tion (11.2) could be inf in ite. When we have such a probability density function / = f(x, y), the 
probability that the point (x, y) is in some region R contained in the domain D (the notation we 
use here is "P((x, y) e R)") is determined by 

P({x, y) £ R) = JJ^ f(x, y) dA. 


Activity 11.12. 

A firm manufactures smoke detectors. Two components for the detectors come from different 
suppliers - one in Michigan and one in Ohio. The company studies these components for 
their reliability and their data suggests that if x is the life span (in years) of a randomly chosen 
component from the Michigan supplier and y the life span (in years) of a randomly chosen 
component from the Ohio supplier, then the joint probability density function / might be given 
by 

f(x,y) = e x e y . 


(a) Theoretically, the components might last forever, so the domain D of the function / is 
the set D of all (x\ y) such that x > 0 and y > 0. To show that / is a probability density 
function on D we need to demonstrate that 

/ J D ^ X,y ^ dA = 1; 

or that 

POO POO 

/ / f(x,y)dydx = 1. 

Jo Jo 

Use your knowledge of improper integrals to verify that / is indeed a probability den¬ 
sity function. 

(b) Assume that the smoke detector fails only if both of the supplied components fail. To 
determine the probability that a randomly selected detector will fail within one year, we 
will need to determine the probability that the life span of each component is between 
0 and 1 years. Set up an appropriate iterated integral, and evaluate the integral to 
determine the probability. 
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(c) What is the probability that a randomly chosen smoke detector will fail between years 
3 and 7? 

(d) Suppose that the manufacturer determines that one of the components is more likely to 
fail than the other, and hence conjectures that the probability density function is instead 
f(x, y) = Ke~ x e~ 2y . What is the value of K? 


< 


Summary 

• The mass of a lamina D with a mass density function 5 = S(x, y) is / f D S(x, y) dA. 

• The area of a region D in the plane has the same numerical value as the volume of a solid of 
uniform height 1 and base D, so the area of D is given by ff D 1 dA. 

• The center of mass, (x, y), of a continuous lamina with a variable density S(x, y) is given by 

= f f D xS(x, y) dA __ Jf D y5(x, y) dA 

IId ‘Hu v) d A an y ff D d(x, y) dA 

• Given a joint probability density function / is a function of two independent variables x and y 
defined on a domain D, if R is some subregion of D, then the probability that (x. y) is in R is 
given by 

JJ R f(x,y)dA. 


Exercises 

1. A triangular plate is bounded by the graphs of the equations y = 2x, y = Ax, and y = 4. The 
plate's density at (x, y) is given by 6(x. y) = 4 xy 2 +1, measured in grams per square centimeter 
(and x and y are measured in centimeters). 

(a) Set up an iterated integral whose value is the mass of the plate. Include a labeled sketch 
of the region of integration. Why did you choose the order of integration you did? 

(b) Determine the mass of the plate. 

(c) Determine the exact center of mass of the plate. Draw and label the point you find on 
your sketch from (a). 

(d) What is the average density of the plate? Include units on your answer. 

2. Let D be a half-disk lamina of radius 3 in quadrants IV and I, centered at the origin as in Activity 
11.9. Assume the density at point (x, y) is equal to x. 

(a) Before doing any calculations, what do you expect the y-coordinate of the center of mass 
to be? Why? 
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(b) Set up iterated integral expressions which, if evaluated, will determine the exact center 
of mass of the lamina. 

(c) Use appropriate technology to evaluate the integrals to find the center of mass numeri¬ 
cally. 

3. Let x denote the time (in minutes) that a person spends waiting in a checkout line at a grocery 
store and y the time (in minutes) that it takes to check out. Suppose the joint probability density 
for x and y is 

f(x,y) = i e -*/ 4 -V 2 . 

(a) What is the exact probability that a person spends between 0 to 5 minutes waiting in 
line, and then 0 to 5 minutes waiting to check out? 

(b) Set up, but do not evaluate, an iterated integral whose value determines the exact prob¬ 
ability that a person spends at most 10 minutes total both waiting in line and checking 
out at this grocery store. 

(c) Set up, but do not evaluate, an iterated integral expression whose value determines the 
exact probability that a person spends at least 10 minutes total both waiting in line and 
checking out, but not more than 20 minutes. 
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11.5 Double Integrals in Polar Coordinates 

Motivating Questions 

In this section, zve strive to understand the ideas generated by the following important questions: 

• What are the polar coordinates of a point in two-space? 

• How do we convert between polar coordinates and rectangular coordinates? 

• What is the area element in polar coordinates? 

• How do we convert a double integral in rectangular coordinates to a double integral in 
polar coordinates? 


Introduction 


While we have naturally defined double integrals in the rectangular coordinate system, starting 
with domains that are rectangular regions, there are many of these integrals that are difficult, 
if not impossible, to evaluate. For example, consider the domain D that is the unit circle and 
f(x, y) = e~ x ~ y . To integrate / over D, we would use the iterated integral 



f(x,y) dA 



ry=V 1—x' 2 


t/=—s/1—a 


,-x 2 -y 2 


dy dx. 


For this particular integral, regardless of the order of integration, we are unable to find an an¬ 
tiderivative of the integrand; in addition, even if we were able to find an antiderivative, the inner 
limits of integration involve relatively complicated functions. 

It is useful, therefore, to be able to translate to other coordinate systems where the limits of 
integration and evaluation of the involved integrals is simpler. In this section we provide a quick 
discussion of one such system - polar coordinates - and then introduce and investigate their ram¬ 
ifications for double integrals. The rectangular coordinate system allows us to consider domains 
and graphs relative to a rectangular grid. The polar coordinate system is an alternate coordinate 
system that allows us to consider domains less suited to rectangular coordinates, such as circles. 

Preview Activity 11.5. The coordinates of a point determine its location. In particular, the rectan¬ 
gular coordinates of a point P are given by an ordered pair (x, y), where x is the (signed) distance 
the point lies from the y-axis to P and y is the (signed) distance the point lies from the rc-axis to P. 
In polar coordinates, we locate the point by considering the distance the point lies from the origin, 
(0,0), and the angle the line segment from the origin to P forms with the positive rc-axis. 


(a) Determine the rectangular coordinates of the following points: 

i. The point P that lies 1 unit from the origin on the positive .i-axis. 
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ii. The point Q that lies 2 units from the origin and such that OQ makes an angle of | 
with the positive x-axis, where O is the origin, (0,0). 

iii. The point R that lies 3 units from the origin such that OR makes an angle of ^ with 
the positive x-axis. 

(b) Part (a) indicates that the two pieces of information completely determine the location 
of a point: either the traditional (x, y) coordinates, or alternately, the distance r from the 
point to the origin along with the angle 9 that the line through the origin and the point 
makes with the positive x-axis. We write "(r, 6)” to denote the point's location in its polar 
coordinate representation. Find polar coordinates for the points with the given rectangular 
coordinates. 

i. (0,-1) ii-(-2,0) hi-(-1,1) 

(c) For each of the following points whose coordinates are given in polar form, determine the 
rectangular coordinates of the point. 

i. (5, f) ii. (2, f) iii. (V3, f) 


CXI 


A Quick Overview of Polar Coordinates 

The rectangular coordinate system is best suited for graphs and regions that are naturally con¬ 
sidered over a rectangular grid. The polar coordinate system is an alternative that offers good 
options for functions and domains that have more circular characteristics. A point P in rectan¬ 
gular coordinates that is described by an ordered pair (x, y ), where x is the displacement from 
P to the y-axis and y is the displacement from P to the x-axis, as seen in Preview Activity 11.5, 
can also be described with polar coordinates (r, 9), where r is the distance from P to the origin 
and 9 is the angle formed by the line segment OP and the positive x-axis, as shown in Figure 
11.22. Trigonometry and the Pythagorean Theorem allow for straightforward conversion from 
rectangular to polar, and vice versa. 


Converting from rectangular to polar. If we are given the rectangular coordinates (x, y) of a 
point P , then the polar coordinates (r, 9) of P satisfy 

r = J x 2 + y 2 and tan(0) = —, assuming x / 0. 

x 

Converting from polar to rectangular. If we are given the polar coordinates (r, 9) of a point 
P, then the rectangular coordinates (x, y) of P satisfy 

x = rcos($) and y = rsin(9). 


@©@© 






220 


11.5. DOUBLE INTEGRALS IN POLAR COORDINATES 



Figure 11.22: The polar coordinates of 
a point. 


?r/2 



3tt/2 


Figure 11.23: The polar coordinate 
grid. 


We can draw graphs of curves in polar coordinates in a similar way to how we do in rect¬ 
angular coordinates. However, when plotting in polar coordinates, we use a grid that considers 
changes in angles and changes in distance from the origin. In particular, the angles 9 and distances 
r partition the plane into small wedges as shown in Figure 11.23. 

Activity 11.13. 

Most polar graphing devices 4 can plot curves in polar coordinates of the form r = f(d). Use 
such a device to complete this activity. 

(a) Before plotting the polar curve r = 1, think about what shape it should have, in light of 
how r is connected to x and y. Then use appropriate technology to draw the graph and 
test your intuition. 

(b) The equation 9 = 1 does not define r as a function of 9, so we can't graph this equation 
on many polar plotters. What do you think the graph of the polar curve 9 = 1 looks 
like? Why? 

(c) Before plotting the polar curve r = 9, what do you think the graph looks like? Why? 
Use technology to plot the curve and compare your intuition. 

(d) What about the curve r = sin(0)? After plotting this curve, experiment with others of 
your choosing and think about why the curves look the way they do. 


<1 

4 You can use your calculator in POL mode, or a web applet such as http : //webspace . ship . edu/msrenault/ 
ggb/polar_grapher . html 
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Integrating in Polar Coordinates 


Consider the double integral 



* 2+y2 dA, 


where D is the unit disk. While we cannot directly evaluate this integral in rectangular coordi¬ 
nates, a change to polar coordinates will convert it to one we can easily evaluate. 


We have seen how to evaluate a double integral 
form 



f(x,y) dA as an iterated integral of the 



f(x, y) dy dx 


in rectangular coordinates, because we know that dA = dy dx in rectangular coordinates. To make 
the change to polar coordinates, we not only need to represent the variables x and y in polar 
coordinates, but we also must understand how to write the area element, dA, in polar coordinates. 
That is, we must determine how the area element dA can be written in terms of dr and d6 in the 
context of polar coordinates. We address this question in the following activity. 



Figure 11.24: A polar rectangle. 


Figure 11.25: An annulus. 


Activity 11.14. 

Consider a polar rectangle R, with r between r, and r l+ \ and 9 between 8j and 9j + 1 as shown 
in Figure 11.24. Let Ar = r ? ; + 1 — r, and A 9 = 0 J+ \ — 9 r Let A A be the area of this region. 

(a) Explain why the area A A in polar coordinates is not Ar A 9. 

(b) Now find A A by the following steps: 

i. Find the area of the annulus (the washer-like region) between r, and r l+ \, as shown 
at right in Figure 11.25. This area will be in terms of r t and r. l+ \. 
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ii. 


Observe that the region R is only a portion of the annulus, so the area A/1 of R is 
only a fraction of the area of the annulus. For instance, if 9i + \ — 9{ were |, then the 
resulting wedge would be 


n 


4 

2t r 


1 

8 


of the entire annulus. In this more general context, using the wedge between the 
two noted angles, what fraction of the area of the annulus is the area A A? 

iii. Write an expression for A A in terms of r„ r l+ ], 9j, and 9 J+ \. 

iv. Finally, write the area A A in terms of r„ r l+ \, A r, and A9, where each quantity 
appears only once in the expression. (Hint: Think about how to factor a difference 
of squares.) 


(c) As we take the limit as Ar and A 9 go to 0, A r becomes dr, A 9 becomes d9, and A A 
becomes dA, the area element. Using your work in (iv), write dA in terms of r, dr, and 


d9. 


<1 

From the result of Activity 11.14, we see when we convert an integral from rectangular coordi¬ 
nates to polar coordinates, we must not only convert x and y to being in terms of r and 9, but we 
also have to change the area element to dA = r dr d9 in polar coordinates. In other words, given 
a double integral f f /) fix, y) dA in rectangular coordinates, to write a corresponding iterated inte¬ 
gral in polar coordinates, we replace x with r cos (9), y with r sin($) and dA with r dr d9. Of course, 
we need to describe the region D in polar coordinates as well. To summarize: 

The double integral ff D fix, y) dA in rectangular coordinates can be converted to a double 
integral in polar coordinates as f f D f(r co s(9),r sin(0)) r dr d9. 


Example 11.2. Let f(x,y ) = e x2+y2 on the disk D = {(x,y) : x 2 + y 2 < 1}. We will evaluate 

SSd /(*» y) dA - 


In rectangular coordinates the double integral 
integral 



f(x, y) dA can be written as the iterated 



f(x,y) dA 



ry=V 1—x 2 

J y=—\/l—x 2 


dy dx. 


We cannot evaluate this iterated integral, because e x2+y ~ does not have an elementary antideriva¬ 
tive with respect to either x or y. However, since r 2 = x 2 + y 2 and the region D is circular, it 
is natural to wonder whether converting to polar coordinates will allow us to evaluate the new 
integral. To do so, we replace x with r cos (9), y with r sin(0), and dy dx with r dr d9 to obtain 


D 


f(x,y)dA= e r rdrd9. 


D 


@©@© 
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The disc D is described in polar coordinates by the constraints 0 < r < 1 and 0 < 6 < 2tt. 
Therefore, it follows that 


pO=2 ty pr =1 

e r2 rdrdd= / / e^rdrdO. 

D Je =o Jr= o 


We can evaluate the resulting iterated polar integral as follows: 


r6=2iT pr =1 


r* 27 T 


e r r dr d6 = 


V 


10=0 Jr =0 


/ 9=0 

1 /■0=2tt 


r= 1 N 


r =0 > 


do 


2 ln- 


0=0 


(e — 1) dO 


^- i) L 


= 2 (e-l )[*] 

= 7r(e — 1). 


0=2n 

d6 
=o 

e=27r 

e=o 


While there is no firm rule for when polar coordinates can or should be used, they are a natural 
alternative anytime the domain of integration may be expressed simply in polar form, and/or 
when the integrand involves expressions such as \Jx 2 + y 2 . 

Activity 11.15. 

Let f(x,y) = x + y and D = {(x,y) : x 2 + y 2 < 4}. 

(a) Write the double integral of / over D as an iterated integral in rectangular coordinates. 

(b) Write the double integral of / over D as an iterated integral in polar coordinates. 

(c) Evaluate one of the iterated integrals. Why is the final value you found not surprising? 



Figure 11.26: The graphs of y = x and x 2 + (y — l) 2 = 1, for use in Activity 11.16. 
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Activity 11.16. 

Consider the circle given by x 2 + (y — l) 2 = 1 as shown in Figure 11.26. 

(a) Determine a polar curve in the form r = f(9) that traces out the circle x 2 + (y — l) 2 = 1. 

(b) Find the exact average value of g(x, y ) = \J x 2 + y 2 over the interior of the circle a: 2 + 

(y- i) 2 = i- 

(c) Find the volume under the surface h(x,y) = x over the region I), where D is the region 
bounded above by the line y = x and below by the circle. 

(d) Explain why in both (b) and (c) it is advantageous to use polar coordinates. 

<1 

Summary 

• The polar representation of a point P is the ordered pair (r, 9) where r is the distance from the 
origin to P and 9 is the angle the ray through the origin and P makes with the positive x-axis. 

• The polar coordinates r and 9 of a point (x, y) in rectangular coordinates satisfy 

r = \/x 2 + y 2 and tan(0) = —; 

x 

the rectangular coordinates x and y of a point (r, 9) in polar coordinates satisfy 

x = rcos(0) and y = rsin(0). 


• The area element dA in polar coordinates is determined by the area of a slice of an annulus and 
is given by 


dA = rdrd9. 


• To convert the double integral ff D f(x,y) dA to an iterated integral in polar coordinates, we 
substitute r cos (9) for x, r sin(0) for y, and r dr d9 for dA to obtain the iterated integral 



f(r cos(9), r sin(0)) r dr d9. 


Exercises 

1. Consider the iterated integral I = 


r 0 r 0 


dx dy. 


-3 J- yj9-y 2 X 2 +V 2 + 1 

(a) Sketch (and label) the region of integration. 

(b) Convert the given iterated integral to one in polar coordinates. 

(c) Evaluate the iterated integral in (b). 
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(d) State one possible interpretation of the value you found in (c). 

2. Let D be the region that lies inside the unit circle in the plane. 

(a) Set up and evaluate an iterated integral in polar coordinates whose value is the area of 
D. 

(b) Determine the exact average value of f(x, y) = y over the upper half of D. 

(c) Find the exact center of mass of the lamina over the portion of D that lies in the first 
quadrant and has its mass density distribution given by S(x, y) = 1. (Before making 
any calculations, where do you expect the center of mass to lie? Why?) 

(d) Find the exact volume of the solid that lies under the surface z = 8 — x 2 — y 2 and over 
the unit disk, D. 


3. For each of the following iterated integrals, (a) sketch and label the region of integration, (b) 
convert the integral to the other coordinate system (if given in polar, to rectangular; if given in 
rectangular, to polar), and (c) choose one of the two iterated integrals to evaluate exactly 


(a) 

(b) 

(c) 

(d) 


/*37r/2 p3 

/ / r 3 drd0 

lit Jo 

r 2 /yi-oz-i) 2 


y / x 2 + y 2 dy dx 

/o J-y/ l-U-i ) 2 

j-n/2 n sin(0) - 

/ / ry 1 — r 2 dr d6. 

Jo Jo 


ry/2/2 ry/l-y 2 


cos(.x 2 + y 2 ) dx dy. 


' 0 Jy 
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11.6 Surfaces Defined Parametrically and Surface Area 

Motivating Questions 

In this section, zve strive to understand the ideas generated by the following important questions: 

• What is a parameterization of a surface? 

• How do we find the surface area of a parametrically defined surface? 


Introduction 

We have now studied at length how curves in space can be defined parametrically by functions 
of the form r(t) = (x(t), y(t), z{t)), and surfaces can be represented by functions 2 = f(x, y). In 
what follows, we will see how we can also define surfaces parametrically A one-dimensional 
curve in space results from a vector function that relies upon one parameter, so a two-dimensional 
surface naturally involves the use of two parameters. If x = x(s, t), y = y(s, t), and z = z(s, t ) are 
functions of independent parameters s and t, then the terminal points of all vectors of the form 

r(s, t) = x(s, t) i + y(s, t) j + z(s, t) k 

form a surface in space. The equations x = x(s,t), y = y(s,t), and z = z(s,t) are the parametric 
equations for the surface, or a parametrization of the surface. In Preview Activity 11.6 we investigate 
how to parameterize a cylinder and a cone. 

Preview Activity 11.6. Recall the standard parameterization of the unit circle that is given by 

x{t) = cos (t) and y(t) = sin(t), 


where 0 < t < 2n. 

(a) Determine a parameterization of the circle of radius 1 in M 3 that has its center at (0,0,1) 
and lies in the plane z = 1. 

(b) Determine a parameterization of the circle of radius 1 in 3-space that has its center at 
(0,0, —1) and lies in the plane z = — 1. 

(c) Determine a parameterization of the circle of radius 1 in 3-space that has its center at 
(0,0, 5) and lies in the plane z = 5. 

(d) Taking into account your responses in (a), (b), and (c), describe the graph that results from 
the set of parametric equations 

x(s, t) = cos(f), y(s, t) = sin(t), and z(s,t) = s, 

where 0 < t < 2 -k and — 5 < s < 5. Explain your thinking. 
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(e) Just as a cylinder can be viewed as a "stack" of circles of constant radius, a cone can be 
viewed as a stack of circles with varying radius. Modify the parametrizations of the circles 
above in order to construct the parameterization of a cone whose vertex lies at the origin, 
whose base radius is 4, and whose height is 3, where the base of the cone lies in the plane 
2 = 3. Use appropriate technology 5 to plot the parametric equations you develop. (Hint: 
The cross sections parallel to the xz plane are circles, with the radii varying linearly as 2 
increases.) 


CXI 


Parametric Surfaces 

In a single-variable setting, any function may have its graph expressed parametrically. For in¬ 
stance, given y = g(x), by considering the parameterization ( t,g(t )) (where t belongs to the do¬ 
main of g), we generate the same curve. What is more important is that certain curves that are not 
functions may be represented parametrically; for instance, the circle (which cannot be represented 
by a single function) can be parameterized by (cos(f), sin(f)), where 0 < t < 2 tt. 

In the same way, in a two-variable setting, the surface 2 = f(x, y ) may be expressed paramet¬ 
rically by considering 

(x(s,t),y(s,t),z(s,t)) = (s,t,f(s,t)), 

where (s, t ) varies over the entire domain of /. Therefore, any familiar surface that we have stud¬ 
ied so far can be generated as a parametric surface. But what is more powerful is that there are 
surfaces that cannot be generated by a single function 2 = f(x,y) (such as the unit sphere), but 
that can be represented parametrically. We now consider an important example. 

Example 11.3. Consider the torus (or doughnut) shown in Figure 11.27. 



Figure 11.27: A torus 


To find a parametrization of this torus, we recall our work in Preview Activity 11.6. There, 
we saw that a circle of radius r that has its center at the point (0, 0, 20 ) and is contained in the 

5 e.g., http :/ /www . flashandmath . com/mathlets/multicalc/paramrec/surf_graph_rectan . html 
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horizontal plane z = zq, as shown in Figure 11.28, can be parametrized using the vector-valued 
function r defined by 

r (t) = r cos(f )i + r sin(f)j + zok 


where 0 < t < 2 tt. 



Figure 11.28: A circle in a horizontal plane centered at (0,0, zq). 


To obtain the torus in Figure 11.27, we begin with a circle of radius a in the xz- plane centered 
at (6, 0), as shown on the left of Figure 11.29. We may parametrize the points on this circle, using 
the parameter s, by using the equations 

x(s) = b + acos(s) and z(s) = asin(s), 

where 0 < s < 2ir. 




Figure 11.29: Revolving a circle to obtain a torus. 

Let's focus our attention on one point on this circle, such as the indicated point, which has 
coordinates (x(s), 0, z(s)) for a fixed value of the parameter s. When this point is revolved about 
the z-axis, we obtain a circle contained in a horizontal plane centered at (0, 0, z(s)) and having 
radius x(s), as shown on the right of Figure 11.29. If we let t be the new parameter that generates 
the circle for the rotation about the z-axis, this circle may be parametrized by 
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r(s, t) = x(s) cos(f)i + x(s) sin(f)j + z(s)k. 
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Now using our earlier parametric equations for x(s) and z(s) for the original smaller circle, we 
have an overall parameterization of the torus given by 

r(s, t) = (b + a cos(s)) cos(f)i + (b + a cos(s)) sin(i)j + a sin(s)k. 

To trace out the entire torus, we require that the parameters vary through the values 0 < s < 2ir 
and 0 < t < 27r. 


Activity 11.17. 

In this activity, we seek a parametrization of the sphere of radius R centered at the origin, 
as shown on the left in Figure 11.30. Notice that this sphere may be obtained by revolving a 
half-circle contained in the xz -plane about the z-axis, as shown on the right. 



Figure 11.30: A sphere obtained by revolving a half-circle. 

(a) Begin by writing a parametrization of this half-circle using the parameter s: 

x(s) = ..., z(s) = .... 

Be sure to state the domain of the parameter s. 

(b) By revolving the points on this half-circle about the z-axis, obtain a parametrization 
r(s, t) of the points on the sphere of radius R. Be sure to include the domain of both 
parameters s and t. (Hint: What is the radius of the circle obtained when revolving a 
point on the half-circle around the z axis?) 

(c) Draw the surface defined by your parameterization with appropriate technology 6 . 


<1 


The Surface Area of Parametrically Defined Surfaces 

Recall that a differentiable function is locally linear - that is, if we zoom in on the surface around 
a point, the surface looks like its tangent plane. We now exploit this idea in order to determine 

6 e.g., http : //web . monroecc . edu/manila/webfiles/calcNSF/JavaCode/CalcPlot3D . htm or http : / / 
www . flashandmath . com/mathlets/multicalc/paranirec/surf_graph_rectan . html 
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the surface area generated by a parametrization (x(s, t),y(s, t). z(s , t)). The basic idea is a familiar 
one: we will subdivide the surface into small pieces, in the approximate shape of small parallel¬ 
ograms, and thus estimate the entire the surface area by adding the areas of these approximation 
parallelograms. Ultimately, we use an integral to sum these approximations and determine the 
exact surface area. 

Let 

r(s, t) = x(s, t) i + y(s, t) j + z(s, t)k 

define a surface over a rectangular domain a < s < b and c < t < d. As a function of two variables, 
s and t, it is natural to consider the two partial derivatives of the vector-valued function r, which 
we define by 

r s (s,t) = x s (s,t)i + y s (s,t)j + z s (s,t)k and r t (s,t) = x t (s,t) i + y t (s,t) j + zt(s,t) k. 

In the usual way, we slice the domain into small rectangles. In particular, we partition the interval 
[a, b] into m subintervals of length As = b ~ a and let so, s\, ..., s m be the endpoints of these 
subintervals, where a = so < s\ < S 2 <■■■< s m = b. Also partition the interval [c, d] into n 
subintervals of equal length At = ,l n c and let to, ti, ..., t n be the endpoints of these subintervals, 
where c = to < ti < t -2 < ■ ■ ■ < t n = d. These two partitions create a partition of the rectangle 
R = [a, b] x [c,d] in st-coordinates into mn sub-rectangles R, tJ with opposite vertices (sj_i, tj-i) 
and ( Si,tj ) for i between 1 and m and j between 1 and n. These rectangles all have equal area 
AA = As • At. 

Now we want to think about the small piece of area on the surface itself that lies above one of 
these small rectangles in the domain. Observe that if we increase s by a small amount As from 
the point (sj_i, tj-i) in the domain, then r changes by approximately r s (sj_i, fj_i)As. Similarly, 
if we increase t by a small amount At from the point (sj_i, tj-i), then r changes by approximately 
r t(si_i,tj_i)Af. So we can approximate the surface defined by r on the st-rectangle [s,_i. s,] x 
[tj-i,tj] with the parallelogram determined by the vectors r s (sj_i, tj-i)As and r/,(s,;_i, i)At, 
as seen in Figure 11.31. 



Figure 11.31: Approximation surface area with a parallelogram. 

Say that the small parallelogram has area S{j. If we can find its area, then all that remains is to 
sum the areas of all of the generated parallelograms and take a limit. Recall from our earlier work 
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in the course that given two vectors u and v, the area of the parallelogram spanned by u and v is 
given by the magnitude of their cross product, |u x v|. In the present context, it follows that the 
area, S t j, of the parallelogram determined by the vectors r a (sj_i, tj-i)As and tj-\)At is 

Sij = |(r s (sj_i,tj_i)As) x (r t (si-utj-i)At)\ = x r t (si-i,tj-i)\AsAt, (11.3) 

where the latter equality holds from standard properties of the cross product and length. 

We sum the surface area approximations from Equation (11.3) over all sub-rectangles to obtain 
an estimate for the total surface area, S, given by 

m n 

\r s (si-i,tj-i) x r t (si_i,^_i)|AsAh 

i= 1 j =1 

Taking the limit as m, n —> oo shows that the surface area of the surface defined by r over the 
domain D is given as follows. 

Let r(s, t ) = (x(s, t), y(s, t ), z(s, t)) be a parameterization of a smooth surface over a domain 
D. The area of the surface defined by r on D is given by 

S = JJ |r s x r f | dA. (11.4) 


Activity 11.18. 

Consider the cylinder with radius a and height h defined parametrically by 

r(s, t) = a cos(s)i + a sin(s)j + tk 
for 0 < s < 2 tt and 0 < t < h, as shown in Figure 11.32. 



Figure 11.32: A cylinder. 
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(a) Set up an iterated integral to determine the surface area of this cylinder. 

(b) Evaluate the iterated integral. 

(c) Recall that one way to think about the surface area of a cylinder is to cut the cylinder 
horizontally and find the perimeter of the resulting cross sectional circle, then multi¬ 
ply by the height. Calculate the surface area of the given cylinder using this alternate 
approach, and compare your work in (b). 

<1 

As we noted earlier, we can take any surface z = f(x, y) and generate a corresponding pa¬ 
rameterization for the surface by writing (s, t. f(s , t)). Hence, we can use our recent work with 
parametrically defined surfaces to find the surface area that is generated by a function / = f(x,y) 
over a given domain. 

Activity 11.19. 

Let z = f(x,y) define a smooth surface, and consider the corresponding parameterization 

r (s, t) = (s,t,f(s,t)). 

(a) Let D be a region in the domain of /. Using Equation 11.4, show that the area, S, of the 
surface defined by the graph of / over D is 

S = jj \jifx(x,y)) 2 + ( f y {x,y )) 2 + 1 dA. 

(b) Use the formula developed in (a) to calculate the area of the surface defined by f(x, y) = 

x 2 over the rectangle D = [—2,2] x [0,3]. 

(c) Observe that the surface of the solid describe in (b) is half of a circular cylinder. Use 
the standard formula for the surface area of a cylinder to calculate the surface area in a 
different way, and compare your result from (b). 


<1 


Summary 

• A parameterization of a curve describes the coordinates of a point on the curve in terms of a 
single parameter t, while a parameterization of a surface describes the coordinates of points on 
the surface in terms of two independent parameters. 

• If r (s, t ) = (x(s, t),y(s, t),z(s, t)) describes a smooth surface in 3-space on a domain D, then the 
area, S, of that surface is given by 


S 



x rt| dA. 
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Exercises 

1. Consider the ellipsoid given by the equation 


x 2 y 2 z 2 

— + — + — = 1 . 
16 25 9 


In Activity 11.17, we found that a parameterization of the sphere S of radius R centered at the 
origin is 

x(r,s) = Rcos(s) cos(f), y(s,t ) = Hcos(s) sin(t), and z(s,t) = Rsm(s) 
for — % < s < f and 0 < t < 2ir. 

(a) Let (x, y, z ) be a point on the ellipsoid and let X = | ,Y= |, and Z = §. Show that 
(A, Y., Z) lies on the sphere S. Hence, find a parameterization of S in terms of X, Y, and 
Z as functions of s and t. 

(b) Use the result of part (a) to find a parameterization of the ellipse in terms of x, y, and 
z as functions of s and t. Check your parametrization by substituting x, y, and z into 
the equation of the ellipsoid. Then check your work by plotting the surface defined by 
your parameterization with appropriate technology 7 . 


2. In this exercise, we explore how to use a parametrization and iterated integral to determine the 
surface area of a sphere. 

(a) Set up an iterated integral whose value is the portion of the surface area of a sphere of 
radius R that lies in the first octant (see the parameterization you developed in Activity 
11.17). 

(b) Then, evaluate the integral to calculate the surface area of this portion of the sphere. 

(c) By what constant must you multiply the value determined in (b) in order to find the 
total surface area of the entire sphere. 

(d) Finally, compare your result to the standard formula for the surface area of sphere. 

3. Consider the plane generated by z = f(x, y) = 24 — 2x — 3 y over the region D = [0, 2] x [0, 3]. 

(a) Sketch a picture of the overall solid generated by the plane over the given domain. 

(b) Determine a parameterization r(s, t) for the plane over the domain D. 

(c) Use Equation 11.4 to determine the surface area generated by / over the domain D. 

(d) Observe that the vector u = (2,0, —4) points from (0, 0, 24) to (2, 0, 20) along one side 
of the surface generated by the plane / over D. Find the vector v such that u and v 
together span the parallelogram that represents the surface defined by / over D, and 
hence compute |u x v|. What do you observe about the value you find? 

7 e.g., http : //web . monroecc . edu/manila/webfiles/calcNSF/JavaCode/CalcPlot3D . htm or http : / / 
www . flashandmath . com/mathlets/multicalc/paramrec/surf_graph_rectan . html 
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4. A cone with base radius a and height h can be realized as the surface defined by z = | \Jx~ + y 2 , 
where a and h are positive. 

(a) Find a parameterization of the cone described by z = ^ \Jx 1 + y 2 . (Hint: Compare to 
the parameterization of a cylinder as seen in Activity 11.18.) 

(b) Set up an iterated integral to determine the surface area of this cone. 

(c) Evaluate the iterated integral to find a formula for the lateral surface area of a cone of 
height h and base a. 
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11.7 Triple Integrals 

Motivating Questions 

In this section, zve strive to understand the ideas generated by the following important questions: 

• How are a triple Riemann sum and the corresponding triple integral of a continuous func¬ 
tion / = f(x, y, z ) defined? 

• What are two things the triple integral of a function can tell us? 


Introduction 

We have now learned that we define the double integral of a continuous function / = f(x, y) over 
a rectangle R = [a, b] x [c, d] as a limit of a double Riemann sum, and that these ideas parallel the 
single-variable integral of a function g = g(x) on an interval [a, b}. Moreover, this double integral 
has natural interpretations and applications, and can even be considered over non-rectangular 
regions, D. For instance, given a continuous function / over a region D, the average value of /, 
/avg(d), is given by 

f AVG(R) = jj d f ( X ’ y) dA, 

where A(D) is the area of D. Likewise, if S(x, y) describes a mass density function on a lamina 
over D, the mass, M, of the lamina is given by 

M = jj 5(x,y)dA. 

It is natural to wonder if it is possible to extend these ideas of double Riemann sums and 
double integrals for functions of two variables to triple Riemann sums and then triple integrals 
for functions of three variables. We begin investigating in Preview Activity 11.7. 

Preview Activity 11.7. Consider a solid piece granite in the shape of a box B = {(x,y,z) : 0 < 
x < 4,0 < y < 6, 0 < ^ < 8}, whose density varies from point to point. Let 5(x,y,z) represent 
the mass density of the piece of granite at point ( x, y, z) in kilograms per cubic meter (so we are 
measuring x, y, and z in meters). Our goal is to find the mass of this solid. 

Recall that if the density was constant, we could find the mass by multiplying the density and 
volume; since the density varies from point to point, we will use the approach we did with two- 
variable lamina problems, and slice the solid into small pieces on which the density is roughly 
constant. 

Partition the interval [0,4] into 2 subintervals of equal length, the interval [0,6] into 3 subinter¬ 
vals of equal length, and the interval [0,8] into 2 subintervals of equal length. This partitions the 
box B into sub-boxes as shown in Figure 11.33. 
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Figure 11.33: A partitioned three-dimensional domain. 


(a) Let 0 = xo < x\ < X 2 = 4 be the endpoints of the subintervals of [0,4] after partitioning. 
Draw a picture of Figure 11.33 and label these endpoints on your drawing. Do likewise 
with 0 = yo < yi < 2/2 < 2/3 = 6 and 0 = zq < z\ < Z 2 = 8 

What is the length Ax of each subinterval [x/_i, Xi] for i from 1 to 2? the length of Ay? of 
A z? " 

(b) The partitions of the intervals [0,4], [0, 6] and [0, 8] partition the box B into sub-boxes. How 
many sub-boxes are there? What is volume AV of each sub-box? 

(c) Let B l]k denote the sub-box [x,_i. x,] x [y ? _ i. y ; ] x [z k _. i, z k ], Say that we choose a point 
(x* ]k , y* jk - z*- k ) in the i,j. kth sub-box for each possible combination of i,j, k. What is the 
meaning of 5(x* jk , y* jk , z* jk )7 What physical quantity will 5{x* jk , y* jk , z* jk )AV approxi¬ 
mate? 

(d) What final step(s) would it take to determine the exact mass of the piece of granite? 


CXI 


Triple Riemann Sums and Triple Integrals 

Through the application of a mass density distribution over a three-dimensional solid. Preview 
Activity 11.7 suggests that the generalization from double Riemann sums of functions of two 
variables to triple Riemann sums of functions of three variables is natural. In the same way, so 
is the generalization from double integrals to triple integrals. By simply adding a ^-coordinate to 
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our earlier work, we can define both a triple Riemann sum and the corresponding triple integral. 

The Triple Riemann Sum. 

Definition 11.3. Let / = f(x, y, z ) be a continuous function on a box B = [a, b] x [c, d] x [r, s]. 

The triple Riemann sum of / over B is created as follows. 

• Partition the interval [a, b] into m subintervals of equal length Ax = Let xq, x \, ..., 
x m be the endpoints of these subintervals, where a = x$ < x\ < x-> < ■ ■ ■ < x m = b. Do 
likewise with the interval [c, d] using n subintervals of equal length Ay = to generate 
c = yo < yi < y 2 < ■ ■ ■ < Vn = d, and with the interval [r, s] using ('. subintervals of equal 
length Az = to have r = zo < z\ < Z 2 < ■ ■ ■ < zi = s. 

• Let Bijk be the sub-box of B with opposite vertices (xi-i,yj-i,Zk-i) and (xi, yj,Zk) for i 
between 1 and m, j between 1 and n, and k between 1 and i. The volume of each Bijk is 
AV = Ax ■ Ay ■ Az. 

• Let (x*j k , y*j k , z*- k ) be a point in box Bij k for each i, j, and k. The resulting triple Riemann 
sum for / on B is 

m n i 

i= 1 j =1 fc=l 


If f(x, y. z) represents the mass density of the box B, then, as we saw in Preview Activity 11.7, 
the triple Riemann sum approximates the total mass of the box B. In order to find the exact mass 
of the box, we need to let the number of sub-boxes increase without bound (in other words, let m, 
n, and t go to infinity); in this case, the finite sum of the mass approximations becomes the actual 
mass of the solid B. More generally, we have the following definition of the triple integral. 


The Triple Integral Over a Box. 

Definition 11.4. With following notation defined as in a triple Riemann sum, the triple inte¬ 
gral of / over B is 



f(x,y,z ) 


dV 


lim 

m,n,£^ oo 


m n £ 

EEE/(4-4>4)' m 


i =1 j =1 k =1 


As we noted earlier, if f(x, y. z) represents the density of the solid B at each point (x, y, z), then 


M = 



f(x, y, z) dV 


is the mass of B. Even more importantly, for any continuous function / over the solid B, we can 
use a triple integral to determine the average value of / over B, f AVC{B) . We note this generalization 
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of our work with functions of two variables along with several others in the following important 
boxed information. Note that each of these quantities may actually be considered over a general 
domain S in M 3 , not simply a box, B. 


• The triple integral 

v(s)= IIL idv 

represents the volume of the solid S. 

• The average value of the function / = f(x, y, x) over a solid domain S is given by 


/avG(S) — 


fix, y, z) dV, 


V(S); jjj s 

where V(S) is the volume of the solid S. 

The center of mass of the solid S with density 5 = S(x, y, z) is (x, y, z), where 

- _ Ills x g ( x > y> z ) dv - _ Ills y s i x ' y> z ) dv ^ _ Ills z 6 ( x > y» z ) dv 

m ’ s m ’ " M 


and M = j j j 5{x,y,z ) dV is the mass of the solid S. 


In the Cartesian coordinate system, the volume element dV is dz dy dx, and, as a consequence, 
a triple integral of a function / over a box B = [a, b\ x [c, d] x [r, s] in Cartesian coordinates can be 
evaluated as an iterated integral of the form 



f(x,y,z) dV 


f(x, y, z) dz dy dx. 


If we want to evaluate a triple integral as an iterated integral over a solid S that is not a box, 
then we need to describe the solid in terms of variable limits. 

Activity 11.20. 

(a) Set up and evaluate the triple integral of f(x, y,z) = x — y + 2z over the box B = 

[-2,3] x [1,4] x [0,2], 

(b) Let S be the solid cone bounded by z = \Jx 1 + y 2 and z = 3. A picture of S is shown 
at right in Figure 11.34. Our goal in what follows is to set up an iterated integral of the 
form 

rx=? ry =? pz=l 

/ / / 6(x,y, z) dz dy dx (11.5) 

J x=? J y=? J z=? 

to represent the mass of S in the setting where S(x, y, z) tells us the density of S at the 
point (x, y. z). Our particular task is to find the limits on each of the three integrals. 
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Figure 11.34: At right, the cone; at left, its projection. 


i. If we think about slicing up the solid, we can consider slicing the domain of the 
solid's projection onto the xy-plane (just as we would slice a two-dimensional re¬ 
gion in M 2 ), and then slice in the 2 -direction as well. The projection of the solid 
is onto the xy-plane is shown at left in Figure 11.34. If we decide to first slice the 
domain of the solid's projection perpendicular to the x-axis, over what range of 
constant x-values would we have to slice? 

ii. If we continue with slicing the domain, what are the limits on y on a typical slice? 
How do these depend on x? What, therefore, are the limits on the middle integral? 

iii. Finally, now that we have thought about slicing up the two-dimensional domain 
that is the projection of the cone, what are the limits on z in the innermost integral? 
Note that over any point (x, y) in the plane, a vertical slice in the 2 direction will 
involve a range of values from the cone itself to its flat top. In particular, observe 
that at least one of these limits is not constant but depends on x and y. 

iv. In conclusion, write an iterated integral of the form (11.5) that represents the mass 
of the cone S. 




Note well: When setting up iterated integrals, the limits on a given variable can be only in terms 
of the remaining variables. In addition, there are multiple different ways we can choose to set up 
such an integral. For example, two possibilities for iterated integrals that represent a triple integral 


/(x, y, z) dV over a solid S are 



rh 2 (x,y) 


hi(x,y) 

rq 2 (x,z) 

qi{x,z) 


f(x, y, z) dz dy dx 
/(x,y, z) dy dx dz 


where g\, g 2 , hi, ha, Pi, P 2 , <?i, and qa are functions of the indicated variables. There are four other 
options beyond the two stated here, since the variables x, y, and 2 can (theoretically) be arranged 
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in any order. Of course, in many circumstances, an insightful choice of variable order will make it 
easier to set up an iterated integral, just as was the case when we worked with double integrals. 

Example 11.4. Find the mass of the tetrahedron in the first octant bounded by the coordinate 
planes and the plane x + 2y + 3z = 6 if the density at point (x. y, z ) is given by S(x, y, z) = x + y + z. 
A picture of the solid tetrahedron is shown at left in Figure 11.35. 



Figure 11.35: The tetrahedron and its projection. 


We find the mass, M, of the tetrahedron by the triple integral 


M = 



S(x,y,z) 


dV, 


where S is the solid tetrahedron described above. In this example, we choose to integrate with 
respect to z first for the innermost integral. The top of the tetrahedron is given by the equation 

x + 2y + 3z = 6; 


solving for z then yields 

z= ^(6-x-2 y). 

The bottom of the tetrahedron is the xy-plane, so the limits on ,2 in the iterated integral will be 

0 < z < ^(6 — x — 2 y). 

To find the bounds on x and y we project the tetrahedron onto the xy- plane; this corresponds 
to setting z = 0 in the equation z = |(6 - x — 2 y). The resulting relation between x and y is 

x + 2y = 6. 

The right image in Figure 11.35 shows the projection of the tetrahedron onto the xy-plane. 

If we choose to integrate with respect to y for the middle integral in the iterated integral, then 
the lower limit on y is the x-axis and the upper limit is the hypotenuse of the triangle. Note that 
the hypotenuse joins the points (6, 0) and (0,3) and so has equation y = 3 — \x. Thus, the bounds 
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on y are 0 < y < 3 — \x. Finally, the x values run from 0 to 6, so the iterated integral that gives the 
mass of the tetrahedron is 


n 3-(l/2)s /•( 1/3) (6—21—22/) 

/ x + y + zdzdydx. 

Jo 


Evaluating the triple integral gives us 

r-6 r 3-{l/2)x p(l/3)(6—x—2y) 

M = 



1 0 JO JO 

r-6 r 3-(l/2)x 


xz + yz + - 


'0 J 0 

r-6 f 3-{l/2)x 4 5 


x + y + xdzdydx 
(l/3)(6—x—2 y) 

dy dx 


-x — —x" — -xy + -y — -y +2dy dx 


o J o 
r-6 r /i 


18 


9 


'0 


— ^ O < O _L O O 
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( 11 . 6 ) 


Setting up limits on iterated integrals can require considerable geometric intuition. It is im¬ 
portant to not only create carefully labeled figures, but also to think about how we wish to slice 
the solid. Further, note that when we say "we will integrate first with respect to x," by "first" we 
are referring to the innermost integral in the iterated integral. The next activity explores several 
different ways we might set up the integral in the preceding example. 

Activity 11.21. 

There are several other ways we could have set up the integral to give the mass of the tetrahe¬ 
dron in Example 11.4. 

(a) How many different iterated integrals could be set up that are equal to the integral in 
Equation (11.6)? 

(b) Set up an iterated integral, integrating first with respect to z, then x, then y that is 
equivalent to the integral in Equation (11.6). Before you write down the integral, think 
about Figure 11.35, and draw an appropriate two-dimensional image of an important 
projection. 

(c) Set up an iterated integral, integrating first with respect to y, then z, then x that is equiv¬ 
alent to the integral in Equation (11.6). As in (b), think carefully about the geometry 
first. 
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(d) Set up an iterated integral, integrating first with respect to x, then y, then z that is 
equivalent to the integral in Equation (11.6). 


Now that we have begun to understand how to set up iterated triple integrals, we can apply 
them to determine important quantities, such as those found in the next activity. 

Activity 11.22. 

A solid S is bounded below by the square z = 0, —l<x< 1,-1 < y < 1 and above by the 
surface z = 2 — x 2 — y 2 . A picture of the solid is shown in Figure 11.36. 



Figure 11.36: The solid bounded by the surface z = 2 — x 2 — y 2 . 

(a) Set up (but do not evaluate) an iterated integral to find the volume of the solid S. 

(b) Set up (but do not evaluate) iterated integral expressions that will tell us the center of 
mass of S, if the density at point (x, y, z) is S(x, y, z) = x 2 + 1. 

(c) Set up (but do not evaluate) an iterated integral to find the average density on S using 
the density function from part (b). 

(d) Use technology appropriately to evaluate the iterated integrals you determined in (a), 
(b), and (c); does the location you determined for the center of mass make sense? 


<1 


Summary 

• Let / = f(x, y, z) be a continuous function on a box B = [a, b] X [c, d] X [r, s]. The triple integral 
of / over B is defined as 



f(x, y, z) dV 


Um y*jk > z *jk) ■ av > 


AV-s-0- 


i =1 j =1 fc= 1 


where the triple Riemann sum is defined in the usual way The definition of the triple integral 
naturally extends to non-rectangular solid regions S. 
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The triple integral 


f(x, y, z) dV can tell us 


- the volume of the solid S if f(x, y, z) = 1, 

- the mass of the solid S if / represents the density of S at the point (x. y, z). 
Moreover, 


/avG(S) — 


1 


V(S) 



f(x, y, z) dV, 


is the average value of / over S. 


Exercises 

1. Consider the solid S that is bounded by the parabolic cylinder y = x 2 and the planes z = 0 and 
2 = 1 — y as shown in Figure 11.37. 



Figure 11.37: The solid bounded by y = x 2 and the planes z = 0 and z = 1 — y. 

Assume the density of S is given by d(x, y,z) = z 

(a) Set up (but do not evaluate) an iterated integral that represents the mass of S. Integrate 
first with respect to z, then y, then x. A picture of the projection of S onto the .xy-plane 
is shown in Figure 11.38. 

(b) Set up (but do not evaluate) an iterated integral that represents the mass of S. In this 
case, integrate first with respect to y, then z, then x. A picture of the projection of S onto 
the xz-plane is shown in Figure 11.39. 

(c) Set up (but do not evaluate) an iterated integral that represents the mass of S. For this 
integral, integrate first with respect to x, then ;//, then z. A picture of the projection of S 
onto the yz -plane is shown in Figure 11.39. 

(d) Which of these three orders of integration is the most natural to you? Why? 
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Figure 11.38: Projecting S Figure 11.39: Projecting S Figure 11.40: Projecting S 
onto the xy-plane. onto the xz- plane. onto the yz- plane. 


2. This problem asks you to investigate the average value of some different quantities. 

(a) Set up, but do not evaluate, an iterated integral expression whose value is the average 
sum of all real numbers x, y, and 2 that have the following property: y is between 0 and 
2, x is greater than or equal to 0 but cannot exceed 2 y, and z is greater than or equal to 
0 but cannot exceed x + y. 

(b) Set up, but do not evaluate, an integral expression whose value represents the average 
value of f(x, y,z ) = x + y + z over the solid region in the first octant bounded by the 
surface z = 4 — x — y 2 and the coordinate planes x = 0, y = 0, z = 0. 

(c) How are the quantities in (a) and (b) similar? How are they different? 

3. Consider the solid that lies between the paraboloids z = g(x, y) = x 2 + y 2 and z = f(x, y) = 
8 — 3x 2 — 3 y 2 . 

(a) By eliminating the variable z, determine the curve of intersection between the two 
paraboloids, and sketch this curve in the x-y plane. 

(b) Set up, but do not evaluate, an iterated integral expression whose value determine the 
mass of the solid, integrating first with respect to x, then y, then z. Assume the the 
solid's density is given by 5(x, y, z) = x 2 +y2+z 2 +v 

(c) Set up, but do not evaluate, iterated integral expressions whose values determine the 
mass of the solid using all possible remaining orders of integration. Use 6(x,y,z) = 
x 2 +y2+ .. 2 +l as the density of the solid. 

(d) Set up, but do not evaluate, iterated integral expressions whose values determine the 
center of mass of the solid. Again, assume the the solid's density is given by 5(x, y, z) = 

l 

x 2_\_y2_\_ z 2 -\-l * 

(e) Which coordinates of the center of mass can you determine without evaluating any in¬ 
tegral expression? Why? 
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4. In each of the following problems, your task is to 


(i) sketch, by hand, the region over which you integrate 

(ii) set up iterated integral expressions which, when evaluated, will determine the value 
sought 

(iii) use appropriate technology to evaluate each iterated integral expression you develop 

Note well: in some problems you may be able to use a double rather than a triple integral, and 
polar coordinates may be helpful in some cases. 

(a) Consider the solid created by the region enclosed by the circular paraboloid z = 4 — 
x 2 — y 2 over the region R in the x-y plane enclosed by y = — x and the circle x 2 + y 2 = 4 
in the first, second, and fourth quadrants. 

Determine the solid's volume. 

(b) Consider the solid region that lies beneath the circular paraboloid z = 9 — x 2 — y 2 over 
the triangular region between y = x,y = 2x, and y = 1. Assuming that the solid has its 
density at point (x, y, z) given by d(x, y. z) = xyz + 1, measured in grams per cubic cm, 
determine the center of mass of the solid. 


(c) In a certain room in a house, the walls can be thought of as being formed by the lines 
y = 0, y = 12 + x/4, x = 0, and x = 12, where length is measured in feet. In addition, 
the ceiling of the room is vaulted and is determined by the plane z = 16 — x/6 - y/3. 
A heater is stationed in the corner of the room at (0, 0, 0) and causes the temperature in 
the room at a particular time to be given by 


T{x,y,z) 


80 

1 + — + + — 
~ 1000 ~ 1000 ^ 1000 


What is the average temperature in the room? 

(d) Consider the solid enclosed by the cylinder x 2 + y 2 = 9 and the planes y + z = 5 and 
z = 1. Assuming that the solid's density is given by d(x, y. z) = \Jx 2 + y 2 , find the mass 
and center of mass of the solid. 
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11.8 Triple Integrals in Cylindrical and Spherical Coordinates 

Motivating Questions 

In this section, zve strive to understand the ideas generated by the following important questions: 


• What are the cylindrical coordinates of a point, and how are they related to Cartesian coor¬ 
dinates? 

• What is the volume element in cylindrical coordinates? How does this inform us about 
evaluating a triple integral as an iterated integral in cylindrical coordinates? 

• What are the spherical coordinates of a point, and how are they related to Cartesian coordi¬ 
nates? 

• What is the volume element in spherical coordinates? How does this inform us about eval¬ 
uating a triple integral as an iterated integral in spherical coordinates? 


Introduction 

We have encountered two different coordinate systems in M 2 - the rectangular and polar coordi¬ 
nates systems - and seen how in certain situations, polar coordinates form a convenient alterna¬ 
tive. In a similar way, there there turn out to be two additional natural coordinate systems in M 3 . 
Given that we are already familiar with the Cartesian coordinate system for M 3 , we next investigate 
the cylindrical and spherical coordinate systems (each of which builds upon polar coordinates in 
M 2 ). In what follows, we will see how to convert among the different coordinate systems, how to 
evaluate triple integrals using them, and some situations in which these other coordinate systems 
prove advantageous. 

Preview Activity 11.8. In the following questions, we investigate the two new coordinate systems 
that are the subject of this section: cylindrical and spherical coordinates. Our goal is to consider 
some examples of how to convert from rectangular coordinates to each of these systems, and vice 
versa. Triangles and trigonometry prove to be particularly important. 

The cylindrical coordinates of a point in M 3 are given by (r, 6 , z ) where r and 6 are the polar 
coordinates of the point (x, y) and z is the same z coordinate as in Cartesian coordinates. An 
illustration is given in Figure 11.41. 

(a) Find cylindrical coordinates for the point whose Cartesian coordinates are (—1, \/3, 3). Draw 
a labeled picture illustrating all of the coordinates. 

(b) Find the Cartesian coordinates of the point whose cylindrical coordinates are (2, , l). Draw 

a labeled picture illustrating all of the coordinates. 

The spherical coordinates of a point in M 3 are p (rho), 9, and f (phi), where p is the distance from 
the point to the origin, 6 has the same interpretation it does in polar coordinates, and f is the angle 
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Figure 11.41: The cylindrical coordi¬ 
nates of a point. 


Figure 11.42: The spherical coordi¬ 
nates of a point. 


between the positive z axis and the vector from the origin to the point, as illustrated in Figure 11.42. 


For the following questions, consider the point P whose Cartesian coordinates are (—2,2, \/8). 

(c) What is the distance from P to the origin? Your result is the value of p in the spherical 
coordinates of P. 

(d) Determine the point that is the projection of P onto the ary-plane. Then, use this projection to 
find the value of 9 in the polar coordinates of the projection of P that lies in the plane. Your 
result is also the value of 6 for the spherical coordinates of the point. 

(e) Based on the illustration in Figure 11.42, how is the angle 4> determined by p and the z co¬ 
ordinate of PI Use a well-chosen right triangle to find the value of cj), which is the final 
component in the spherical coordinates of P. Draw a carefully labeled picture that clearly 
illustrates the values of p, 9, and cj) in this example, along with the original rectangular coor¬ 
dinates of P. 

(f) Based on your responses to (c), (d), and (e), if we are given the Cartesian coordinates (x, y, z ) 
of a point Q, how are the values of p, 9, and <f> in the spherical coordinates of Q determined 
by x, y, and z? 


CXI 


Cylindrical Coordinates 

As we stated in Preview Activity 11.8, the cylindrical coordinates of a point are (r, 9 , z), where r 
and 9 are the polar coordinates of the point (x, y), and z is the same z coordinate as in Cartesian 
coordinates. The general situation is illustrated Figure 11.41. 
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Since we already know how to convert between rectangular and polar coordinates in the plane, 
and the z coordinate is identical in both Cartesian and cylindrical coordinates, the conversion 
equations between the two systems in M 3 are essentially those we found for polar coordinates: 

x = rcos(0) y = rsin(9) z = z 

r 2 = x 2 + y 2 tan (9) = — z = z. 

x 

Just as with rectangular coordinates, where we usually write z as a function of x and y to 
plot the resulting surface, in cylindrical coordinates, we often express z as a function of r and 
9. In the following activity, we explore several basic equations in cylindrical coordinates and the 
corresponding surface each generates. 

Activity 11.23. 

In this activity, we graph some surfaces using cylindrical coordinates. To improve your intu¬ 
ition and test your understanding, you should first think about what each graph should look 
like before you plot it using technology. 8 

(a) Plot the graph of the cylindrical equation r = 2, where we restrict the values of 9 and 
z to the intervals 0 < 6 < 2-k and 0 < z < 2. What familiar shape does the resulting 
surface take? How does this example suggest that we call these coordinates cylindrical 
coordinates ? 

(b) Plot the graph of the cylindrical equation 6 = 2, where we restrict the other variables to 
the values 0 < r < 2 and 0 < z < 2. What familiar surface results? 

(c) Plot the graph of the cylindrical equation z = 2, using 0 < 6 < 2n and 0 < r < 2. What 
does this surface look like? 

(d) Plot the graph of the cylindrical equation z = r, where 0 < 0 < 27 t and 0 < r < 2. What 
familiar surface results? 

(e) Plot the graph of the cylindrical equation z = 9 for 0 < 9 < 4-7T. What does this surface 
look like? 


<1 

As the name and Activity 11.23 suggest, cylindrical coordinates are useful for describing sur¬ 
faces that are cylindrical in nature. 


Triple Integrals in Cylindrical Coordinates 


To evaluate a triple integral 


f(x, y, z) dV as an iterated integral in Cartesian coordinates, we 


use the fact that the volume element dV is equal to dz dy dx (which corresponds to the volume of a 


8 e.g., http : //www . math . uri . edu / ~bkaskosz/f lashmo/cylin / - to plot r = 2, set r to 2, 6 to s, and z tot- 

to plot 9 = 7r/3, set 6 = 7r/3, r = s, and z = t, for example. Thanks to Barbara Kaskosz of URI and the Flash and Math 
team. 
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small box). To evaluate a triple integral in cylindrical coordinates, we similarly must understand 
the volume element dV in cylindrical coordinates. 

Activity 11.24. 

A picture of a cylindrical box, B = {(r, 6 , z) : r\ < r < r 2 ,G\ < 0 < O 2 , z\ < z < Z 2 }, is shown 
in Figure 11.43. Let Ar = r 2 — r 1 , Ad = 62 — 0\, and Az = Z 2 — z 1 . We want to determine the 
volume AH of B in terms of Ar, AO, Az, r, 0, and z. 


z 




Figure 11.43: A cylindrical box. 


(a) Appropriately label Ar, AO, and Az in Figure 11.43. 

(b) Let A A be the area of the projection of the box, B, onto the .xy-plane, which is shaded 
blue in Figure 11.43. Recall that we previously determined the area A A in polar coordi¬ 
nates in terms of r, Ar, and AO. In light of the fact that we know A/1 and that z is the 
standard z coordinate from Cartesian coordinates, what is the volume AH in cylindrical 
coordinates? 


<1 


Activity 11.24 demonstrates that the volume element dV in cylindrical coordinates is given by 
dV = r dzdr dO, and hence the following rule holds in general. 

Given a continuous function / = f(x, y, z) over a region S in M 3 , 


f(x,y,z)d,V= / // f (r cos(O), r sin(O), z) r dz dr dO. 

s J J Js 

The latter expression is an iterated integral in cylindrical coordinates. 


Of course, to complete the task of writing an iterated integral in cylindrical coordinates, we 
need to determine the limits on the three integrals: 0, r, and z. In the following activity, we explore 
how to do this in several situations where cylindrical coordinates are natural and advantageous. 
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Activity 11.25. 

In each of the following questions, set up, but do not evaluate, the requested integral expres¬ 
sion. 

(a) Let S be the solid bounded above by the graph of z = x 2 + y' 2 and below by 2 = 0 on 
the unit circle. Determine an iterated integral expression in cylindrical coordinates that 
gives the volume of S. 

(b) Suppose the density of the cone defined by r = 1 — z, with £ > 0, is given by 8 {r, 9 , z) = 
z. A picture of the cone is shown in Figure 11.44, and the projection of the cone onto the 
xy -plane in given in Figure 11.45. Set up an iterated integral in cylindrical coordinates 
that gives the mass of the cone. 




Figure 11.44: The cylindrical cone r = Figure 11.45: The projection into the 

1 — z. xy-plane. 


(c) Determine an iterated integral expression in cylindrical coordinates whose value is the 
volume of the solid bounded below by the cone 2 = \Jx 2 + y 2 and above by the cone 
z = 4 — \Jx 1 + y 2 . A picture is shown in Figure 11.46. 

<1 


Spherical Coordinates 

As we saw in Preview Activity 11.8, the spherical coordinates of a point in 3-space have the form 
(p, 9 , 0 ), where p is the distance from the point to the origin, 9 has the same meaning as in polar 
coordinates, and 0 is the angle between the positive z axis and the vector from the origin to the 
point. The overall situation is illustrated in Figure 11.42. 

The example in Preview Activity 11.8 suggests that given a point in rectangular coordinates, 
(x, y, z), we can find the corresponding spherical coordinates. Indeed, this holds generally by 
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Figure 11.46: A solid bounded by the cones 2 = \Jx 1 + y 2 and z = 4 — i/x 2 + y 2 . 



Figure 11.47: Converting from spherical to Cartesian coordinates. 


using the relationships 


/ - y 2 , 

p = \J x 1 + y 2 + z 2 tan(0) = — cos(0) = - 

x p 

where in the latter two equations, we require and p / 0. 

To convert from given spherical coordinates to Cartesian coordinates, we use the equations 

x = psin(</>) cos(0) y = psin(</>) sin(0) z = p cos(cf>), 

as illustrated in Figure 11.47. 

When it comes to thinking about particular surfaces in spherical coordinates, similar to our 
work with cylindrical and Cartesian coordinates, we usually write p as a function of 6 and </>; this 
is a natural analog to polar coordinates, where we often think of our distance from the origin in 
the plane as being a function of 6 . In spherical coordinates, we likewise often view p as a function 
of 9 and <p, thus viewing distance from the origin as a function of two key angles. 
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In the following activity, we explore several basic equations in spherical coordinates and the 
surfaces they generate. 

Activity 11.26. 

In this activity, we graph some surfaces using spherical coordinates. To improve your intuition 
and test your understanding, you should first think about what each graph should look like 
before you plot it using technology. 9 

(a) Plot the graph of p = 1, where 9 and <P are restricted to the intervals ()<()< 2tt 
and 0 < (f> < it. What is the resulting surface? How does this particular example 
demonstrate the reason for the name of this coordinate system? 

(b) Plot the graph of 0 = |, where p and 9 are restricted to the intervals 0 < p < 1 and 
{)<()< 2i r. What familiar surface results? 

(c) Plot the graph of 6 = |, for 0 < p < 1 and 0 < < ir. What familiar shape arises? 

(d) Plot the graph of p = 9, for 0 < cp < tt and 0 < 9 < 2-k. How does the resulting surface 
appear? 


<1 

As the name and Activity 11.26 indicate, spherical coordinates are particularly useful for de¬ 
scribing surfaces that are spherical in nature; they are also convenient for working with certain 
conical surfaces. 


Triple Integrals in Spherical Coordinates 

As with rectangular and cylindrical coordinates, a triple integral JJJ fix. y. z) dV in spherical 
coordinates can be evaluated as an iterated integral once we understand the volume element dV. 

Activity 11.27. 

To find the volume element dV in spherical coordinates, we need to understand how to deter¬ 
mine the volume of a spherical box of the form p\ < p < P2 (with A p = P2 — Pi), 4 >i < 4 > < <t>2 
(with A cj) = 02 — 4>\), and 9\ < 0 < Hi (with A 9 = O 2 — 0\). An illustration of such a box is given 
in Figure 11.48. This spherical box is a bit more complicated than the cylindrical box we en¬ 
countered earlier. In this situation, it is easier to approximate the volume AV than to compute 
it directly. Here we can approximate the volume AH of this spherical box with the volume of a 
Cartesian box whose sides have the lengths of the sides of this spherical box. In other words, 

AV^\PS\ \PR\ \PQ\, 

where \PR\ denotes the length of the circular arc from P to II. 

9 e.g., http : //www .f lashandmath . com/mathlets/multicalc/paramsphere/surf_graph_sphere . 
html - to plot p = 2, set p to 2, 9 to s, and <j> to t, for example. Thanks to Barbara Kaskosz of URI and the Flash and 
Math team. 
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Figure 11.48: A spherical box. 


Figure 11.49: A spherical volume ele¬ 
ment. 


(a) What is the length \PS\ in terms of p? 

(b) What is the length of the arc PR? (Hint: The arc PR is an arc of a circle of radius p 2 , and 
arc length along a circle is the product of the angle measure (in radians) and the circle's 
radius.) 

(c) What is the length of the arc PQ? (Hint: The arc PQ lies on a horizontal circle as 
illustrated in Figure 11.49. What is the radius of this circle?) 

(d) Use your work in (a), (b), and (c) to determine an approximation for AU in spherical 
coordinates. 


<1 

Letting A p, Ad and Ad go to 0, it follows from the final result in Activity 11.27 that 
dV = p 2 sin(d) dp dxb dd in spherical coordinates, and thus allows us to state the following general 
rule. 


Given a continuous function / = f(x, y, z ) over a region S in M 3 , 


f(x,y,z)dV = // / /(psin(^) cos(9), psin(^) sin(0), pcos(^)) p z sin(</>) dpdc^dd. 
S JJJs 


The latter expression is an iterated integral in spherical coordinates. 


Finally, in order to actually evaluate an iterated integral in spherical coordinates, we must of 
course determine the limits of integration in d, <i>, and p. The process is similar to our earlier work 
in the other two coordinate systems. 

Activity 11.28. 

We can use spherical coordinates to help us more easily understand some natural geometric 
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objects. 

(a) Recall that the sphere of radius a has spherical equation p = a. Set up and evaluate an 
iterated integral in spherical coordinates to determine the volume of a sphere of radius 
a. 

(b) Set up, but do not evaluate, an iterated integral expression in spherical coordinates 
whose value is the mass of the solid obtained by removing the cone (j> = | from the 
sphere p = 2 if the density 5 at the point ( x,y,z ) is 5(x,y,z ) = \Jx' 2 + y' 2 + z 2 . An 
illustration of the solid is shown in Figure 11.50. 



Figure 11.50: The solid cut from the sphere p = 2 by the cone L = |. 


<1 


Summary 

• The cylindrical coordinates of a point P are (r, 0, z ) where r is the distance from the origin to 
the projection of P onto the xy-plane, 6 is the angle that the projection of P onto the xy-plane 
makes with the positive x-axis, and z is the vertical distance from P to the projection of P 
onto the xy-plane. When P has rectangular coordinates (x, y, z), it follows that its cylindrical 
coordinates are given by 

r 2 = x 2 + y 2 , tan (6) = -, z = z. 

x 

When P has given cylindrical coordinates (r, 9 , z), its rectangular coordinates are 

x = rcos(9), y = rsin(9), z = z. 


The volume element dV in cylindrical coordinates is dV = r dzdr dO. Hence, a triple integral 
f(x, y, z) dA can be evaluated as the iterated integral 


f(r cos(0), r sin(0), z) r dz dr d6. 
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The spherical coordinates of a point P in 3-space are p (rho), q> (phi), and 0, where p is the 
distance from P to the origin, 0 is the angle between the positive z axis and the vector from the 
origin to P, and 0 is the angle that the projection of P onto the xy-plane makes with the positive 
x-axis. When P has Cartesian coordinates (x. y. z), the spherical coordinates are given by 


2 2 , 2,2 
p = x +y + z , 


tan(0) = —, cos ((f)) = 

x p 


Given the point P in spherical coordinates (p, ©. 9), its rectangular coordinates are 

x = psin((j)) cos(9), y = psin(^>) sin(0), z = pcos(<f>). 


The volume element dV in spherical coordinates is dV = p 2 sin (<f>) dpd<j>dO. Thus, a triple inte 


gral 


f(x, y, z) dA can be evaluated as the iterated integral 


f(psin((j)) cos(0), psin(0) sin(0), pcos(0)) p 2 sin(</>) dp d<f> dd. 


Exercises 

1. In each of the following questions, set up an iterated integral expression whose value deter¬ 
mines the desired result. Then, evaluate the integral first by hand, and then using appropriate 
technology 

(a) Find the volume of the "cap" cut from the solid sphere x 2 + y 2 + z 2 = 4 by the plane 
^ = 1, as well as the z-coordinate of its centroid. 

(b) Find the x-coordinate of the center of mass of the portion of the unit sphere that lies in 
the first octant (i.e., where x, y, and z are all nonnegative). Assume that the density of 
the solid given by S(x, y, z) = 1+x ^ y2+z2 . 

(c) Find the volume of the solid bounded below by the x-y plane, on the sides by the sphere 
p = 2, and above by the cone 4> = 7t/3. 

(d) Find the z coordinate of the center of mass of the region that is bounded above by the 

surface 2 = \Jsjx 2 +~y^, on the sides by the cylinder x 2 + y 2 = 4, and below by the x-y 
plane. Assume that the density of the solid is uniform and constant. 

(e) Find the volume of the solid that lies outside the sphere x 2 + y 2 + z 2 = 1 and inside the 
sphere x 2 + y 2 + z 2 = 2 z. 

2. For each of the following questions, (a) sketch the region of integration, (b) change the coor¬ 
dinate system in which the iterated integral is written to one of the remaining two, and (c) 
evaluate the iterated integral you deem easiest to evaluate by hand. 

n y/l— X 2 i‘y/2—x 2 — y 2 

/ _ xy dz dy dx 

J\J x 2 +y 2 
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(b) f ' f f p 2 sin(0)dpc^d0 

Jo Jo Jo 

/*27r rl rl 

(c) / / / r 2 cos(9) dzdr dO 

Jo Jo Jr 

3. Consider the solid region S bounded above by the paraboloid z = 16 -x 2 -y 2 and below by 
the paraboloid z = 3x 2 + 3 y 2 . 

(a) Describe parametrically the curve in M 3 in which these two surfaces intersect. 

(b) In terms of x and y, write an equation to describe the projection of the curve onto the 
x-y plane. 

(c) What coordinate system do you think is most natural for an iterated integral that gives 
the volume of the solid? 

(d) Set up, but do not evaluate, an iterated integral expression whose value is average z- 
value of points in the solid region S. 

(e) Use technology to plot the two surfaces and evaluate the integral in (c). Write at least 
one sentence to discuss how your computations align with your intuition about where 
the average z-value of the solid should fall. 
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11.9 Change of Variables 

Motivating Questions 

In this section, we strive to understand the ideas generated by the following important questions: 

• What is a change of variables? 

• What is the Jacobian, and how is it related to a change of variables? 


Introduction 


In single variable calculus, we encountered the idea of a change of variable in a definite integral 
through the method of substitution. For example, given the definite integral 

2 

2x(x 2 + l) 3 dx, 

we naturally consider the change of variable u = x 2 + 1. From this substitution, it follows that 
du = 2.x dx, and since x = 0 implies u = 1 and x = 2 implies u = 5, we have transformed the 
original integral in x into a new integral in u. In particular. 



2x(x 2 + l) 3 dx 



u 3 du. 


The latter integral, of course, is far easier to evaluate. 

Through our work with polar, cylindrical, and spherical coordinates, we have already implic¬ 
itly seen some of the issues that arise in using a change of variables with two or three variables 
present. In what follows, we seek to understand the general ideas behind any change of variables 
in a multiple integral. 

Preview Activity 11.9. Consider the double integral 


I = 



+ y 2 dA, 


where D is the upper half of the unit disk. 


(11.7) 


(a) Write the double integral I given in Equation (11.7) as an iterated integral in rectangular 
coordinates. 

(b) Write the double integral I given in Equation (11.7) as an iterated integral in polar coordi¬ 
nates. 
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When we write the double integral (11.7) as an iterated integral in polar coordinates we make a 
change of variables, namely 


x = rcos(0) and y = rsin(9). (11.8) 

We also then have to change dA to r dr d9. This process also identifies a "polar rectangle" [ri, 7 - 2 ] X 
\0 ]. 0 2 ] with the original Cartesian rectangle, under the transformation 10 in Equation (11.8). The 
vertices of the polar rectangle are transformed into the vertices of a closed and bounded region in 
rectangular coordinates. 

To work with a numerical example, let's now consider the polar rectangle P given by [1, 2] x [|, |], 
so that r\ = 1, r 2 = 2, 9\ = and 9-2 = \. 


(c) Use the transformation determined by the equations in (11.8) to find the rectangular vertices 
that correspond to the polar vertices in the polar rectangle P. In other words, by substi¬ 
tuting appropriate values of r and 9 into the two equations in (11.8), find the values of the 
corresponding x and y coordinates for the vertices of the polar rectangle P. Label the point 
that corresponds to the polar vertex (r±,9i) as (x \, y \), the point corresponding to the polar 
vertex (7*2, 9 \) as (X 2 , 2 / 2 )/ the point corresponding to the polar vertex (n, 62 ) as (x^, 7/3), and 
the point corresponding to the polar vertex ( 7 - 2 , # 2 ) as (X4, 7/4). 

(d) Draw a picture of the figure in rectangular coordinates that has the points (xi, 7/1), (.i'2- IJ 2 ), 
(X3,7/3), and (.x'4,7/4) as vertices. (Note carefully that because of the trigonometric functions 
in the transformation, this region will not look like a Cartesian rectangle.) What is the area 
of this region in rectangular coordinates? How does this area compare to the area of the 
original polar rectangle? 


CXI 


Change of Variables in Polar Coordinates 

The general idea behind a change of variables is suggested by Preview Activity 11.9. There, we 
saw that in a change of variables from rectangular coordinates to polar coordinates, a polar rect¬ 
angle [ri, 7 - 2 ] x [0i, 62 ] gets mapped to a Cartesian rectangle under the transformation 

x = rcos(9) and y = rsm(9). 

The vertices of the polar rectangle P are transformed into the vertices of a closed and bounded 
region P' in rectangular coordinates. If we view the standard coordinate system as having the 
horizontal axis represent r and the vertical axis represent 9, then the polar rectangle P appears to 

10 A transformation is another name for function: here, the equations x = r cos(9) and y = r sin (6) define a function 
T by T(r, 6) = (r cos (6), r sin(#)) so that T is a function (transformation) from R 2 to R 2 . We view this transformation 
as mapping a version of the x-y plane where the axes are viewed as representing r and 8 (the r-6 plane) to the familiar 
x-y plane. 
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us at left in Figure 11.51. The image P' of the polar rectangle P under the transformation given 
by (11.8) is shown at right in Figure 11.51. We thus see that there is a correspondence between a 
simple region (a traditional, right-angled rectangle) and a more complicated region (a fraction of 
an annulus) under the function T given by T(r, 6 ) = ( r cos (9),r sin(0)). 



e 




















p 













r 





‘ 




Figure 11.51: A rectangle P and its image P'. 

Furthermore, as Preview Activity 11.9 suggest, it follows generally that for an original polar 
rectangle P = [n, r?\ x [0\. 62 ], the area of the transformed rectangle P' is given by r2 + ri ArAd. 
Therefore, as Ar and A 6 go to 0 this area becomes the familiar area element dA = r dr d 6 in polar 
coordinates. When we proceed to working with other transformations for different changes in 
coordinates, we have to understand how the transformation affects area so that we may use the 
correct area element in the new system of variables. 


General Change of Coordinates 

We first focus on double integrals. As with single integrals, we may be able to simplify a double 
integral of the form 

JJ f (x,y) dA 

by making a change of variables (that is, a substitution) of the form 

x = x(s,t) and y = y(s,t ) 

where x and y are functions of new variables s and t. This transformation introduces a correspon¬ 
dence between a problem in the xy-plane and one in the the sf-plane. The equations x = x(s, t ) 
and y = y(s, t ) convert s and t to x and y; we call these formulas the change of variable formulas. To 
complete the change to the new s, t variables, we need to understand the area element, dA, in this 
new system. The following activity helps to illustrate the idea. 

Activity 11.29. 

Consider the change of variables 

x = s + 2 t and y = 2 s + \ft. 
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Let's see what happens to the rectangle T = [0,1] x [1,4] in the st-plane under this change of 
variable. 

(a) Draw a labeled picture of T in the st-plane. 

(b) Find the image of the st-vertex (0,1) in the .xy-plane. Likewise, find the respective 
images of the other three vertices of the rectangle T: (0,4), (1,1), and (1,4). 

(c) In the xy-plane, draw a labeled picture of the image, T', of the original .si-rectangle T. 
What appears to be the shape of the image, T'7 

(d) To transform an integral with a change of variables, we need to determine the area 
element dA for image of the transformed rectangle. How would we find the area of the 
.xy-figure T'l (Hint: Remember what the cross product of two vectors tells us.) 


<1 

Activity 11.29 presents the general idea of how a change of variables works. We partition 
a rectangular domain in the st system into subrectangles. Let T = [a, b] x [a + As, b + At] be 
one of these subrectangles. Then we transform this into a region T' in the standard xy Cartesian 
coordinate system. The region T' is called the image of T; the region T is the pre-image of V. 
Although the sides of this xy region T' aren't necessarily straight (linear), we will approximate the 
element of area dA for this region with the area of the parallelogram whose sides are given by the 
vectors v and w, where v is the vector from (x(a, 6), y(a, b)) to (x(a + As, b),y(a + As, b)), and w 
is the vector from (x(a, 6), y(a, b)) to (x(a, b + At), y(a , b + At)). 

An example of an image T' in the xy plane that results from a transformation of a rectangle T 
in the st plane is shown in Figure 11.52. 
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Figure 11.52: Approximating an area of an image resulting from a transformation. 

The components of the vector v are 

v = (x(a + As, b ) — x(a, b),y(a + As, b) — y(a, b), 0) 
and similarly those for w are 

w = {x(a, b + At) — x(a, b),y(a , b + As) — y{a , b), 0) . 
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Slightly rewriting v and w, we have 


v = 


w = 


/ x(a + As,b) - x(a,b) r/(a + As,6) - y(a,b ) \ ^ ^ 

\ As ’ As / 

/ x(a,b + At) - x{a,b) y(a,b + As) - y(a,b ) \ 

\ At ’ At ’ / 


For small As and At, the definition of the partial derivative tells us that 

V ~ (Is^’ 6 ' 1, ^s^ a, ^’°) AS and W ~ At ' 

Recall that the area of the parallelogram with sides v and w is the length of the cross product of 
the two vectors, |v x w|. From this, we observe that 


V X w 


As x (^ a, ^’^ a, ^’°) At 

(°, 0, ^ (a, 6) ^ (a, 6)-^ (a, 6)^ (a, 6)^ As At. 


Finally, by computing the magnitude of the cross product, we see that 


V X w 


(a, 6) (a, 6) - ^ (a, 6) ^ (a, b) ) As At 


g(a,6)g(a,6)-^(a,6)g(a,(.) 


As At. 


Therefore, as the number of subdivisions increases without bound in each direction. As and 
At both go to zero, and we have 


dA 


dx dy 

dx dy 

ds dt 

dt ds 


ds dt. 


(11.9) 


Equation (11.9) hence determines the general change of variable formula in a double integral, 
and we can now say that 



f(x,y) dA 



f(x,y) dy dx 



f{x{s,t),y{s,t)) 


dx dy 
ds dt 


dx dy 
dt ds 


ds dt. 


The quantity 


dx dy dx dy 
ds dt dt ds 
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is called the Jacobian, and we denote the Jacobian using the shorthand notation 


d{x, y) 


dx dy 

dx dy 

d(s,t ) 


ds dt 

dt ds 


To summarize, the preceding change of variable formula that we have derived now follows. 


Change of Variables in a Double Integral. Suppose a change of variables x = x(s, t ) and y = 
y(s , f) transforms a closed and bounded region R in the si-plane into a closed and bounded 
region R' in the ary-plane. Under modest conditions (that are studied in advanced calculus), it 
follows that 



f(x,y)dA 



f(x(s,t),y(s,t)) 


d(x,y) 

d(s,t) 


d-s dt. 


Activity 11.30. 

Find the Jacobian when changing from rectangular to polar coordinates. That is, for the trans¬ 
formation given by x = r cos (8), y = r sin(0), determine a simplified expression for the quantity 


dx dy 

dx dy 

dr d6 

d6 dr 


What do you observe about your result? How is this connected to our earlier work with double 
integrals in polar coordinates? 

<1 


Given a particular double integral, it is natural to ask, "how can we find a useful change of 
variables?" There are two general factors to consider: if the integrand is particularly difficult, 
we might choose a change of variables that would make the integrand easier; or, given a compli¬ 
cated region of integration, we might choose a change of variables that transforms the region of 
integration into one that has a simpler form. These ideas are illustrated in the next activities. 

Activity 11.31. 

Consider the problem of finding the area of the region D' defined by the ellipse x 2 + \ = 1. 
Here we will make a change of variables so that the pre-image of the domain is a circle. 


(a) Let x(s, t) = s and y{s. t ) = 2 1. Explain why the pre-image of the original ellipse (which 
lies in the xy plane) is the circle s 2 + t 2 = 1 in the si-plane. 


11 If you are familiar with determinants of matrices, we can can also represent 
matrix 


9(x,y) 

d(s,t) 


as the determinant of a 2 x 2 


d(x,y) 
d(s, t) 


dx dx 
ds dt 

dy dy 
ds dt 
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where D is the disk bounded by the circle s 2 + t 2 = 1. In particular, explain the source 


of the "2" in the st integral. 

(c) Without evaluating any of the integrals present, explain why the area of the original 
elliptical region D' is 2 tt. 


<1 


Activity 11.32. 

Let D' be the region in the xy-plane bounded by the lines y = 0, x = 0, and x + y = 1. We will 
evaluate the double integral 



( 11 . 10 ) 


with a change of variables. 


(a) Sketch the region D' in the xy plane. 

(b) We would like to make a substitution that makes the integrand easier to antidifferenti¬ 
ate. Let s = x + y and t = x — y. Explain why this should make antidifferentiation easier 
by making the corresponding substitutions and writing the new integrand in terms of 
s and t. 

(c) Solve the equations s = x + y and t = x — y for x and y. (Doing so determines the 
standard form of the transformation, since we will have x as a function of s and t, and 
y as a function of s and t.) 

(d) To actually execute this change of variables, we need to know the sf-region D that cor¬ 
responds to the xy-regi on I)'. 

i. What st equation corresponds to the xy equation x + y = 1? 

ii. What st equation corresponds to the xy equation x = 0? 

iii. What st equation corresponds to the xy equation y = 0? 

iv. Sketch the st region D that corresponds to the xy domain I)'. 

(e) Make the change of variables indicated by s = x + y and t = x — y in the double integral 
(11.10) and set up an iterated integral in st variables whose value is the original given 
double integral. Finally, evaluate the iterated integral. 


<1 
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Change of Variables in a Triple Integral 


Given a function / = f{x , y, z ) over a region S' in M 3 , similar arguments can be used to show that 
a change of variables x = x(s, t, u), y = y(s, t, u ), and z = z(s, t, u) in a triple integral results in the 
equality 


JJJ f{x,y,z)dV 

where 


In expanded form. 


d(x,y,z) 
d(s, t, u) 


),y(s, 

,t,u) 

,z(s 

dx 

dx 

dx 

ds 

~dt 

du 

dy 

dy 

dy 

ds 

dt 

du 

dz 

dz 

dz 

ds 

dt 

du 


d{x , y, z) 
d(s, t, u ) 


ds dt du , 


d(x,y,z) 

dx 

dy dz 

dy dz 

dx 

dy dz 

dy dz 

dx 

dy dz 

dy dz 

d(s, t, u) 

ds 

dt du 

du dt 

dt 

ds du 

du ds 

du 

ds dt 

dt ds 


The expression 


d{x,y,z) 

d(s,t,u) 


is again called the Jacobian. 


Summary 

• If an integral is described in terms of one set of variables, we may write that set of variables in 
terms of another set of the same number of variables. If the new variables are chosen appropri¬ 
ately, the transformed integral may be easier to evaluate. 

• The Jacobian is a scalar function that relates the area or volume element in one coordinate 
system to the corresponding element in a new system determined by a change of variables. 


Exercises 

1. Let D' be the region in the xy plane that is the parallelogram with vertices (3, 3), (4, 5), (5,4), 

and (6, 6). 

(a) Sketch and label the region D' in the xy plane. 

(b) Consider the integral f f D , ( x + y) dA. Explain why this integral would be difficult to set 
up as an iterated integral. 

(c) Let a change of variables be given by x = 2u + v, y = u + 2v. Using substitution or 
elimination, solve this system of equations for u and v in terms of x and y. 
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(d) Use your work in (c) to find the pre-image, D, which lies in the uv plane, of the orig¬ 
inally given region D', which lies in the xy plane. For instance, what uv point corre¬ 
sponds to (3, 3) in the xy plane? 

(e) Use the change of variables in (c) and your other work to write a new iterated integral 
in u and v that is equivalent to the original xy integral f f D ,(x + y) cLA. 

(f) Finally, evaluate the uv integral, and write a sentence to explain why the change of 
variables made the integration easier. 

2. Consider the change of variables 

x(p, 9, 4 >) = psin((/>) cos(0) y(p, 9, 4 >) = psin(</>) sin(0) z(p, 9 , cj>) = pcos(</>), 

which is the transformation from spherical coordinates to rectangular coordinates. Determine 
the Jacobian of the transformation. How is the result connected to our earlier work with iterated 
integrals in spherical coordinates? 

3. In this problem, our goal is to find the volume of the ellipsoid ^ + K- = 1. 

(a) Set up an iterated integral in rectangular coordinates whose value is the volume of the 
ellipsoid. Do so by using symmetry and taking 8 times the volume of the ellipsoid in 
the first octant where x, y, and z are all nonnegative. 

(b) Explain why it makes sense to use the substitution x = as, y = bt, and z = cu in order 
to make the region of integration simpler. 

(c) Compute the Jacobian of the transformation given in (b). 

(d) Execute the given change of variables and set up the corresponding new iterated inte¬ 
gral in s, t, and u. 

(e) Explain why this new integral is better, but is still difficult to evaluate. What additional 
change of variables would make the resulting integral easier to evaluate? 

(f) Convert the integral from (d) to a new integral in spherical coordinates. 

(g) Finally, evaluate the iterated integral in (f) and hence determine the volume of the ellip¬ 
soid. 
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double integral over a general region, 201 
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function 

differentiable, 130 
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of two variables, 3 
surface, 4 
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definition, 62 

iterated integral 

cylindrical coordinates, 249 
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Jacobian, 262, 264 

joint probability density function, 214 

Lagrange multiplier, 177 

level curve, 12 

line 

direction vector, 51 
in space, 51 

parametric equations, 54 
vector equation, 52 
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partial derivatives 
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sum, parallelogram, 25 
vector-valued function 
antiderivative, 73 
definition, 62 
derivative, 67 
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